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(57) Abstract 



The present invention relates to the three- 
dimensional structures of a protein tyrosine kinase 
optionally complexed with one or more compounds. 
The atomic coordinates that define the structures of 
the protein tyrosine kinase and any of the compounds 
bound to it are pertinent to methods for determining 
the three-dimensional structures of protein tyrosine 
kinases with unknown structure and to methods 
that identify modulators of protein tyrosine kinase 
functions. 
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DESCRIPTION 

CRYSTAL STRUCTURES OF A PROTEIN TYROSINE KINASE 

5 RELATED APPLICATIONS 

This application is related to U.S. Application 
Serial No. 08/701,191, by Mohammadi , et al . , entitled 
"Crystals of the Tyrosine Kinase Domain of Non- Insulin 
Receptor Tyrosine Kinases," filed August 21, 1996 (Lyon 

10 Sc Lyon Docket No. 227/088) and U.S. Application Serial 
No. 60/034,168, by McMahon , et al . , entitled "Crystal 
Structures of a Protein Tyrosine Kinase Complexed with 
Compounds of the Oxindolmone/Thiolindolmone Family," 
filed December 19, 1996 (Lyon & Lyon Docket No. 

15 221/282) , which are hereby incorporated herein by 

reference in their entirety including any drawings, 
tables , and figures . 

INTRODUCTION 

20 The present invention relates to the three 

dimensional structures of protein kinases. 

BACKGROUND O F THE INVENTION 
The following description of the background of the 
~ " invention is provided simply as an aid in understanding 
the invention and is not admitted to describe or 
constitute prior art to the invention. 

Protein tyrosine kinases (PTKs) comprise a large 
and diverse class of enzymes (for a review, see 
30 Schlessinger and Ullrich, 1992, Neuron 9: 383-391). The 
PTK family contains multiple subfamilies, one of which 
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* * "~ ~ ' ^ ^ "Sr.zyT.a!! ica^ -Tar.sisr a h i cr h 6r.s r c:v 
pnospnate from, adenosine triphosphate to a tyrosine 
residue in a target protein. These phosphorylation 
events regulate cellular phenomena m signal 
transduction processes. Cellular signal transduction 
processes contain multiple steps that convert an 
extracellular signal into an intracellular signal. The 
intracellular signal is then converted into a cellular 
response. PTKs are components m many signal 
transduction processes. A ?TK regulates the flow of a 
signal m a particular step m the process by 
pnosphoryiatmg a downstream molecule. The addition of 
a phosphate can either modulate the activity of the 
downstream molecule by turning it "on" or "off". Thus, 
aberrations in a particular FTK's activity can either 
cause overflow or underflow of the signal. Overflow of 
a signal can lead to such abnormalities as uncontrolled 
cell proliferation; which is representative of such 
disorders as cancer and angiogenesis . 

Scientists in the biomedical community are 
searching for PTK inhibitors that down- regulate overflow 
25 signal transduction pathways. In particular, small 

molecule PTK inhibitors are sought that can traverse the 
cell mem.brane and not become hydroiyzed m acidic 
environments. These small molecule PTK inhibitors can 
oe highly bioavailabie and can be administered orally to 
3 0 oat lent s . 



= ome small 



)~ecule PiK inhibitors have already 
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oeen discovered. For example , bis ■ monocyclic ; cicvcl: 
or heterocyclic aryl compounds (?CT WC 92'20542), 
vinylene-azamdoie derivatives :?CT WC 94 1430S: , 1- 
cyciopropyl-4 -pyridyi -quinolones ;".S. Patent No. 
5 5,330,992), styryl compounds (U . S . Patent No . 

5,217,999), styryl-suostituted pyridyi compounds (U.S. 
Patent No. 5,302,606), certain qumazoline derivatives 
(EP Application No. 0 566 266 Al ) , seleomdoies and 
selenides (PCT WO 94/03427) , tricyclic polyhydroxylic 
10 compounds (PCT WO 92/21660) , and oenzy iphosphonic acid 

compounds (PCT WO 91/15495) are described as PTK 
inhibitors . 

Although many PTK inhibitors are known, many of 
these are not specific for PTK subfamilies and will 

15 therefore cause multiple side-effects as therapeutics. 
Compounds of the indolinone family, however, are 
specific for the FGFR subfamily and are non- 
hydrolyzable. WO 96/4 0116, "Indolinone Compounds for 
the Treatment of Disease," published December 19, 1996, 

20 inventors Tang et al . Although the use of X-ray 

crystallography has provided three dimensional 
structures of other PTKs, they are not complexed with 
PTK subfamily specific, hydrolysis resistant, small 
molecules . 

Respite recent advances, the need remains in tne 
art for crystal iographic analysis of protein kinases, s; 
that improved therapeutic molecules can be designed and 
synthesized . 

30 SUMMARY OF THE INVENTION 

The present invention relates to tne three 
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c i . ; . e :~ s i c r a _ siruccur^ o ~ r - - 0 - - , , ^ , ~ > ^ ~ - 
a:cn:: resolution. 

Tne three dimensional structures described herein 
elucidate specific interactions between crctem tvrosme 
kinases and compounds bound to them. The coordinates 
chat define the three dimensional structures of protein 
tyrosine Kinases are useful for determining three 
dimensional structures of ?TKs with unknown structure. 
In addition, the coordinates are alsc useful for 
designing and identifying modulators of protein tyrosine 
kinase function. These modulators are potentially 
useful as therapeutics for diseases, including (but 
limited to; cell proliferative diseases, such as cancer, 
angiogenesis , atherosclerosis, and arthritis. 

Thus in a first aspect, the invention features a 
crystalline form of a polypeptide corresponding to the 
catalytic domain of a protein tyrosine kinase. 

The term "crystalline form," m the context of the 
invention, is a crystal formed from an aqueous solution 
comprising a purified polypeptide corresponding to tne 
catalytic domain of a PTK. A crystalline form of a 
protein tyrosine kinase is characterized as being 
25 capable of diffracting x-rays in a pattern defined by 
one of the crystal forms depicted in Blundel et al . , 
1976 > Protein Crystal lography . Academic Press. A 
crystalline form of a protein kinase is not 
characterized as being capable of diffracting x-rays in 
30 a pattern analogous to a crystalline forrr. consisting of 
prixanr/ sa^t or primarily a compound, for examole . 



20 
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The term "proieii: cyrosme Kinase," or PTK, refers 
:c an er.zyr.e chat transfers the high energy phosphate o r 
adenosine triphosphate to a tyrosine residue located on 
a protein target . 
5 A protein tyrosine kinase catalytic domain of the 

invention can originate from receptor protein tyrosine 
kinases that bind fibroblast growth factor (FGF) . These 
protein tyrosine kinases are known as n FGFR" herein, and 
can relate to one member of the FGFR family, such as 
10 FGFR1 . 

The term "catalytic domain" refers to the region of 
a protein that can exist as a separate entity from the 
protein. The catalytic domain of a protein tyrosine 
kinase is characterized as having considerable amino 

15 acid identity to the catalytic domain of other protein 
tyrosine kinases. Considerable amino acid identity 
preferably refers to at least 30% identity, more 
preferably at least 35% identity, and most preferably at 
least 40% identity. These degrees of amino acid 

20 identity refer to the identity between different protein 
tyrosine kinase families. Amino acid identity for 
members of a given protein tyrosine kinase family range 
from 55% to 90%. The catalytic domain may be functional 

^ _ _ ^ — >f- ^ _„ + _^ + _, J . t»V> cs h is 1 wf- -i /— A r^m 3i n r> -f a nrnf pi n 

as a scpa^auc chl^l) , nit ^cx^dxj^^^ ^ ^ 1 L ^ ^ - ^ w — ^ ^ ^ - — 

25 tyrosine kinase is also characterized as a polypeptide 

that is soluble in solution . 

The term "identity" identity as used herein refers 

to a property of sequences that measures their 

similarity or relationship. Identity is measured by 
30 dividing the number of identical residues in the two 

sequences by the total number of residues and 
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sequences mat are less Highly "-served and have 
deletions, additions, or replacements nave a lower 
degree of identity. Those skilled m the art will 
recognize that several computer programs are available 
for determining sequence identity. 

The term "functional" refers to the abilitv of a 
catalytic domain to convert a substrate into a product 
oy phospncrylat mg the substrate. The term "functional" 
also relates to the ability of a catalytic domain to 
bind natural binding partners. The catalytic region 
may comprise an N-terminal tail, a catalytic core, and a 
l-termmal tail. The catalytic core is a polypeptide 
that can oe functional in terms of catalysis. x- and C- 
terminal tails are polypeptide regions that may not 
confer appreciable functionality m terms of catalysis, 
but may confer functionality in terms of modulator 
specificity . 

A polypeptide can exist as a catalytic domain 
eventhough it is not functional. For example, a 
polypeptide corresponding to a catalytic domain may not 
be functional if it does not harbor phosphate moieties 
cn key areas. Multiple examples of phosphorylation - 
25 state dependent function are well documented m the art. 
Therefore, a catalytic domain can also exist without 
oemg functional. A measure of a protein kinase 
catalytic domain is a polypeptide that is homologous to 
other protein kinase catalytic domains. 
30 The -^rm u poiypept ide" refers to an ammo acid 

chain representing a portion of, or the entire seauence 
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of, amino acids comprising a protein. 

A preferred embodiment of the invention includes a 
crystalline form of a PTK that is a receptor PTK . 
Receptors are proteins that straddle the inside and 
5 outside of the cell membrane. Receptor PTKs comprise an 
extracellular region, a transmembrane region, and an 
intracellular region comprising a catalytic domain. 

Another preferred embodiment of the invention is 
the crystalline form of a receptor PTK selected from the 
10 group consisting of FGF-R, PDGF-R, FLK , CCK4 , MET, TRKA , 

AXL, TIE, EPH, RYK , DDR, ROS , RET, LTK , R0R1 , and MUSK. 

Yet another preferred embodiment of the invention 
is the crystalline form of a PTK that is a non -receptor 
PTK. Non-receptor PTKs are located inside the cell and 
15 do not harbor extracellular or membrane - spanning 

polypeptides attached to the polypeptide corresponding 
to the catalytic domain. Non-receptor PTKs may harbor 
fatty acids or lipids, which can impart a membrane 
associated character to a PTK. In preferred embodiments 
20 of the invention, crystalline forms of non-receptor PTKs 
are selected from the group consisting of SRC, BRK, BTK, 
CSK, ABL, ZAP7 0 , FES, FAK, JAK, and ACK . 

In still another preferred embodiment, the 

iiiVCUtlUU LCauui CO r -A uj. ) buQj. — ^- »-* - - - . 

25 comprises a heavy metal atom. Tnese types of crystals 

can be referred to as derivative crystals. 

The term "derivative crystal" refers to a crystal 

where the polypeptide is in association with one or more 

heavy-metal atoms . 
30 The term "association" refers to a condition of 

proximity between a chemical entity or compound, or 
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ass3::a::cr. nay ce r.cr.-ccvaler.:., i.e., where the 
:ux:apos::icn is energetically favored bv , e.a., 
hydrogen-bonding, van der Waais, ele: 



■ ■ — s ^ d ■_ , 



hydrophobic interactions, or ;: may be covalent . 

The term "heavy metal atom" refers zc an atom ;h a - 
is a transition element, a lanthaniae metal, or an 
actmide metal. Lanthaniae metals include elements with 
atomic r.umoers between 57 and 71, inclusive. Actmide 
r.etals include elements with atomic numbers between 39 
and 103, inclusive. 

In a preferred embodiment, the invention features a 
crystal of an FG? receptor tyrosine kinase domain 
protein. The FGF receptor tyrosine kinase domain 
protein can relate to FGFR1 . 

The term " FGFR1 " refers to one member of multiple 
receptor PTKs that are homologous to one another ana 
bind FGF. in this context, the term "homologous" refers 
to at least 70% amino acid identity between two members 
of the FGFR family. 

The term. " FGFR1 " can also refer to a mutant of 
human FGFR1 which is characterized by the amino acid 
sequence of SEQ ID NO : 2 . As compared to human FGFR1 , 
FGFR1 contains the following ammo acid substitutions: 
Cys-48 8 - Ala, Cys-5 84 - Ser, Leu- 457 - Val , and has an 
additional five ammo acid residues at the N-terminus 
(Ser-Ala-Ala-Gly-Thr) . 

The term "human FGFR1 " refers to the tyrosine 
Kinase domain of human fibroblast growth factor receotor 
_ ; "FGFRl" ■ having the amino acid seauence of 
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NO : 1 . Generally, human FGFR1 comprises a 3 13 ammo acid 
residue fragment ^residues 456 :o 765; of human FGFR1 . 

The ceo "mutant" refers to a polypeptide which is 
obtained by replacing a: least one ammo acid residue m 
5 a native tyrosine kinase domain with a different ammo 
acid residue. Mutation can be accomplished by adding 
and/or deleting amino acid residues within the native 
polypeptide or at the N- and/or C-terminus of a 
polypeptide corresponding to a native tyrosine kinase 
10 domain having substantially the same three-dimensional 
structure as the native tyrosine kinase domain from 
which it is derived. By having substantially the same 
three-dimensional structure is meant having a set o t 
atomic structure coordinates that have a root mean 
15 square deviation (r.m.s.d.) of less than or equal to 
about 2 A when superimposed with the atomic structure 
coordinates of the native tyrosine kinase domain from 
which the mutant is derived when at least about 50% to 
100% of the Ca atoms of the native tyrosine kinase are 
20 included in the superposition. A mutant may have, but 
need not have, PTK activity. 

In another preferred embodiment, the invention 
relates to a crystalline form defined by the structural 
coordinates set forth in Table 1. 
25 The term "atomic structural coordinates*''' as used 

herein refers to a data set that defines the three 
dimensional structure of a molecule or molecules. 
Structural coordinates can be slightly modified and 
still render nearly identical three dimensional 
30 structures. A measure of a unique set of structural 
coordinates is the root -mean- square deviation of the 
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three Clir'i't^Sl'""^ >~ 0 — ~ ^ _ _ j G , - _^ _ - 

3 - :t:her C V = rcct-n.ean-square deviation of less :har. 1.5 
A -ay be viewed by a person of ordinary s>:ill in the an 
as identical. Hence, the structural coordinates set 
rortn m .ab.e 1, Taoie 2, Table 3, and Table 4 are not 
limited to the values defined therein. 

In other preferred embodiments, the invention 
features a crystalline form of the polypeptide in 
association with a compound. These types of crystalline 
forms can oe referred tc as co-crystals. The compound 
may be a cofactor, substrate, substrate analog, 
inhibitor, or alios t eric effector. 

The term "compound" refers to an organic molecule. 
The term "organic molecule" refers tc a molecule which 
has at least one carbon atom m its structure. Tne 
compound can have a molecular weight of less than 6kDa. 
Both the geometry of the compound and the interactions 
termed between tne compound and tne polypeptide 
preferably govern high affinity binding between the two 
molecules. High affinity binding is preferably governed 
by a dissociation equilibrium constant on the order of 
10" 6 M or less. The compound is preferably a modulator 
chat alters the function of a ?TK . 
2d The -erm "function," m reference to the effect of 

a modulator on ?TK function, refers to tne ability of a 
modulator to enhance or inhibit tne catalytic activity 
of a ?TK. 



20 



7 n 



The term "catalytic activity", in the context oi 
tne invention, defines the ability of a PTK to 
phosphoryiate a substrate polypeptide. Catalytic 
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activity can oe measured , f or exar.pl e , oy determining 
the amount of a substrate converted to a product as a 
function of time. The conversion of the substrate to a 
product occurs at tne active-site ct the PTK. 
5 The term "active - site" refers to a cavity located 

m the PTK m which one or more substrate molecules may 
bind. Addition of a modulator to cells expressing a PTK 
may enhance (activate) or lower (inhibit) the catalytic 
activity of the PTK . 

10 A small number of inhibitors of PTK catalytic 

activity are known in the art. Small molecule 
inhibitors may modulate PTK function by blocking the 
binding of substrates. Indclinone compounds, for 
example, may bind to the active-site of PTK catalytic 

15 domains and inhibit them effectively, as measured by 
inhibition constants on the order of 10~ 6 M or less. 

Apf 1 T /-3 t- o >~ Q ^ "f* "DTV i rapol 1 nl ar vn"fi nnc ^- ^ i-i £_ >-> r-i ^. o 

PTK function by interacting with both the PTK catalytic 
domain and the substrate. Activators may also promote 
2 0 dimerizat ion of PTKs and thus act i vat e them by bringing 
them into close proximity with one another. In 
addition, activators may operate by promoting a 
conformational change in the intracellular region of the 

n t v ■. i +-V-%->+- "U a *- 2 1 irt- i n -v- o ^ ^ r-i ^ ^ -! -P -! o ^ <— 1 1 K ^ +- ^ — 

r i i'. DUL.il »_iici.i_ lu<- v-auaxj' i Lh x^n uivjui j. ito oujjo li auco 

25 at a faster rate in the presence of the activator. 

The term "function" can also refer to the ability 
of a modulator to enhance or inhibit the association 
between a PTK and a natural binding partner. 

The term "natural binding partner" refers to a 
3 0 polypeptide that normally binds to a PTK in a cell . 
These natural binding partners can play a role in 
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binding constant on the or< 



- r\ a s~ ■ l 



- ess ■ However, a natural binding partner can ais: 



it. ?TK natural binding partners are chose- fret a 
group consisting of, cut not limited to, sre hcmclcgv 2 

S~2 ■■ or j -3H3 ■ domains, other phosphoryl tyrosine 
cmdmg -?TB; domains, nucleotide exchange factors, and 
other protein kinases or protein phosphatases. 

The term "interactions" refers to hydrophobic, 
aromatic, and ionic forces and hydrogen bonds formed 
between atoms in tne modulator and the enzyme active- 
site. 

The term "cofactor" refers to a compound that may, 
in addition to the substrate, bind to a protein and 
undergo a chemical reaction. Multiple co-factors are 
nucleotides or nucleotide derivatives, such as phosphate 
ana nicotinamide derivatives of adenosine. 

The term "substrate" refers to a compound that 
reacts with an enzyme. Enzymes can catalyze a specific 
reaction on a specific substrate. For example, ?TKs can 
phosphorylate specific protein and peptide substrates on 
tyrosine moieties. In addition, nucleotides can act as 
substrates for protein kinases. 

The term "substrate analog" refers to a compound 
tnat is structurally similar, but not identical, tc a 
substrate. The substrate analog may be a nucleotide 
analog. Examples of nucleotide analogs are described 
be low . 
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The t e rn u mhib: tier " refers tc a compound chat: 
decreases the cellular function cf a protein kinase. 
The protein kinase function is preferably the 
interaction with a natural binding partner and more 
5 preferably catalytic activity. 

The term "ailosteric effector" refers to a compound 
that causes allosteric interactions in a protein. The 
term u alIosteric interactions" refers to interactions 
between separate sites on a protein. The sites can be 
10 different from the active site. The allosteric effector 
can enhance or inhibit catalytic activity by binding to 
a site that may be different than the active site. 

The term "co-crystal" refers to a crystal where the 
polypeptide is in association with one or more 
15 compounds . 

In preferred embodiments, a co-crystal of the 
invention can be in association with a heavy metal atom. 
Examples of heavy metal atoms are described above. 
In other preferred embodiments, the invention 
20 features a co-crystal comprising the crystalline form of 
the polypeptide in association with a compound, where 
the compound is a non-hydrolyzable analog of ATP. These 
analogs can be referred to as nucleotide analogs. 

The term "ATP" refers to the chemical compound 
25 adenosine triphosphate. 

Tne term u non-hydroiyzable" refers to a compound 
having a covalent bond that does not readily react with 
water. Examples of non-hydrolyzable analogs of AT? are 
AMP-PNP and AMP- PC? , whose structures are well known to 
30 those skilled in the art. 

The term "AMP-PNP" refers to adenylyl 
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-icr.^bpncr.are, a ncn-hydrclyzacle analogue cf AT?. 

- t ar.zir.er preferred er.ccdi^er.: , toe i r ve^ r ; ^r 
rebates :c a crystalline fern defined by ere structural 
coordinates set forth in Table 2 . 

In preferred embodiment s , the invention relates to 
crystalline forms, where the compound m association 
with the polypeptide is an mdolmone. 

Certain mdolmones are specific modulators of ?T^ 
function. A preferred embodiment of the invention is 
the crystalline form of a PTK complexed with an 
mdolinone of formula I or II: 



15 
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or a pharmaceut ically acceptable salt, isomer, 

metabolite , ester, amide, or prodrug thereof, where: 

(a) A : , A 2 , A 3 , and A 4 are independently carbon or 
5 nitrogen ; 

(b) R : is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindoiinone or 
sulfur in the case of a thioiindolinone ; 

(d) R 3 is hydrogen; 

10 (e) R 4 , R 5 , R 6 , and R 7 are optionally present, and 

are either (i) independently selected from the group 
consisting of alkyl, alkoxy, aryl, aryloxy, alkaryi, 
alkaryloxy, halogen, trihalomethyl , S(0)R, S0 2 NRR * , S0 3 R, 
SR, N0 2/ NRR ' , OH, CN, C(0)R, 0C(0)R, NHC(0)R, (CH 2 ) R C0 2 R, 

15 and CONRR 1 or (ii) any two adjacent R 4 , R 5/ R 6 , and R 7 



of the indole-based portion of the mdolmone; 

(f) R 2 ' , R 3 ' , R 4 ■ , R 5 1 , and R £ ' are each 
independently selected from the group consisting of 
20 hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryi, 

alkaryloxy, halogen, trihalomethyl, S(0)R, S0 2 NRR ' , S0 3 R, 
SR, NC 2 , NRR', OH, CN , C(0)R, 0C(0)R, NHC (0)R, (CH 2 ; n C0 2 R, 
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i} R ' 13 nydrcgen, a Iky I cr aryl; and 
■. : ; A is a rive T.encered he t e r c a ry 1 r:na s e 1 e c t e d 
frc- :r.e group consisting of tniophene, pyrrole, 
pyrazcle , imi iazole, 1 , 2 . 3 - onazcle , 1 , 2 , 4 - r riazoie , 
cxazole, iscxazcie, tniazcle, isothiazole, furan, 1,2,3 
oxadiazole, 1,2,4- oxadiazole, :,2 ( 5 -oxadiazole , 1,3,4- 
oxaoiazole , 1,2,3, 4 -oxarriazole , 1,2,3, 5 - oxatr la zole , 
1 ' " ' - -thiadiazole, 1,2,4- ohiadiazole , 1,2, 5 - thiadiazole . 
1,3, 4 -ohiadiazole, 1,2,3 , 4 - t hi at riazoie , 1,2,3,5- 
rhiarriazoie, and tetrazole, optionally substituted at 
one cr more positions with alkyi, alkoxy, aryl, aryioxy, 
a_>:ary^, a^karyloxy, halogen, trihalomethyl , 3 (OR, 
SCOTRR', SO.R, SR, N0 2 , NRR ' , OH, CX , C(0;R, OC:0;R 
NKC-;0)R, (CH 2 ).C0 2 R or COXRR * . 

The term "pharmaceut ically aooeptable salt" refers 
to those salts which retain the biological activity and 
properties of the free bases. Pharmaceut ically 
acceptable salts can be obtained oy reaction wi _o 
inorganic acids such as hydrochloric acid, hydrobromic 
acid; sulfuric acid, nitric acid, phosphoric acid, 
metnanesuif onic acid, e thanesul f onic acid, p- 
roluenesulfonic acid, salicylic acid and the like. 

The tern -prodrug- refers to an agent that 13 
converted into the parent drug m vivo. Prodrugs may be 
easier to administer than the parent drug in some 
situations. For example, the prodrug may be 
bicavailable by oral administration but the parent 13 
not, or one prodrug may improve solubility to allow for 
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intravenous adn:nis:rac:cr. . 

"Alkyl" refers to a s:raigh" -chain, branchea or 
cyclic saturated aliphatic hydrocarbon. Preferably, the 
alkyl group has 1 to 12 carbons. More preferably, it is 
5 a lower alkyl of from 1 to 7 carbons, more preferably l 
to 4 carbons. Typical alkyl groups include methyl, 
ethyl, propyl, isopropyl, butyl, isobutyi, tertiary 
butyl, pentyl, hexyl and the like. The alkyl group may 
be optionally substituted with one or more suhstituents 
1C are selected from the group consisting of hydroxyl, 

cyano, alkoxy, =C, = S, N0 2 , halogen, N{CH 3 ) 2 amino, and 
SH. 

"Alkenyl" refers to a straight - chain , branched or 
cyclic unsaturated hydrocarbon group containing at least 
15 one carbon-carbon double bond. Preferably, the alkenyl 
group has 2 to 12 carbons. More preferably it is a 

lower aiKenyi ut ^r^m ^ > ^a^^ iib ' - — t J 

to 4 carbons. The alkenyl group may be optionally 
substituted with one or more substituents selected from 
2 0 the group consisting of hydroxy 1, cyano, alkoxy, =0, = S, 
N0 2/ halogen, N(CH 3 ) 2 amino, and SH . 

"Alkynyl" refers to a straight-chain, branched or 
cyclic unsaturated hydrocarbon containing at least one 
carbon-carbon triple bond. Preferably, the alkynyl 
25 group has 2 to 12 carbons. More preferably it is a lower 
alkynyl of from 2 to 7 carbons, more preferably 2 to 4 
carbons. The alkynyl group may be optionally 
substituted with one or more substituents selected from 
the group consisting of hydroxy 1 , cyano, alkoxy, =C, =S, 
3 0 N0 2 , halogen, N ( CK 3 ; 2 amino, ana SH . 

"Alkoxy" refers to an "O-alkyl" group. 
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anc includes :a; 
biarvl arcucs . 



"DCCVCl i: 



Heterocyclic aryl an 
cup cay be cccicnaliv 



1 — ^ 01 cere suosticuenc 

:ns group consisting of halogen, trihaiomethyl , 
hydroxy!, SH, OH, KC 2 , acme, thioether, cyano, alkoxy, 
alkyl , and ammo . 

"Al-aryl" refers to an alkyl that is covalently 
joined to an aryl group. Preferably, the alkyl is a 
1 owe r a 1 ky 1 . 

"Carbocyclic aryl" refers to an aryl group wherein 
the ring atoms are carbon . 

"Heterocyclic aryl" refers to an aryl group having 
from i to 3 hetsroatoms as ring atoms, the remainder of 
the ring atoms being carbon. Heteroatoms include 
oxygen, sulfur, and nitrogen. Thus, heterocyclic aryl 
groups include f uranyl , thienyi , pyridyi, pyrroivl , N- 
lower alkyl pyrrole, pyrimidyl , pyrazinyl, imidazciyl 
and the like. 

"Amide" refers to -c(Oj-NH-R, where R is alkyl, 
aryl, alkylaryl or hydrogen. 

"Thioamide" refers to - C ( 5 ) -NH--R , where R is alkyl, 
aryl, alkylaryl or hydrogen. 

"Amine" refers to a -N(R')R M group, where R 1 and 
are independently selected from tne group consisting 
of alkyl, aryl, and alkylaryl. 



wnere k is alkyl, arvi 



"Thioether" refers to -S-' 
or alkylaryl . 

"Sulfonyl" refers to -3fO) 2 -R, where R is aryl, 
C ; CN; =C-aryl , CH 2 CN, alkyaryi , sulfonamide, NH- alkyl , NH • 
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alkylaryl, :r NH-aryl. 

The term "acyl" denotes groups -C(0;R, where R is 
aikyl as defined above, such as f ornyl , acetyl , 
propionyl, or butyryl . 
5 It is understood by those skilled in the art that 

when A : , A 2 , A 3 , and A 4 are nitrogen or sulfur that the 
corresponding R 4 , R 5 , R 6 , and R 7 , as well as the 
corresponding bond, do not exist. 

Examples of indoles having such fused rings (as 
10 described in (e) (ii) above include the following: 




15 The six membered rings shown above exemplify 

possible A rings in compound 11. 
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metr.y_pyrrc_ - 5 -yl ; methylene; - 2 - indolincne as well as i- 
.'4- • 4 - rcrmylpicerazine- 1 - yl - benzyl idenylj - 2 - indcl ir.one . 

polypeptide or onese crystalline ferns can oe FGFR, 
and specifically, FGFR1 . 

In preferred embodiments, the crystalline forms of 
tne invention can be defined by the structural 
coordinates set forth m Table 3 or Table 4. 

-ne use or X-ray crystallography can elucidate tne 
three dimensional structure of crystalline forms of the 
invention. The first characterization of crystalline 
rorms by X-ray crystallography can determine the unit 
ceil shape and its orientation in the crystal. 

In other preferred embodiments , the invention 
features a crystal of an FGF receptor tyrosine kinase 
domain protein, where the crystal is characterized by 
having nonoclinic unit ceils. The crystal may also oe 
characterized oy having space group symmetry 32. 

The term "unit cell" refers to the smallest and 
simplest volume element (i.e., parailelpiped- shaped 
block) of a crystal that is completely representative of 
the unit of pattern of the crystal. The dimensions of 
tne unit cell are defined by six numbers: dimensions a, 
b and o and angles or, B and y . A crystal can be viewed 
as an efficiently packed array of multiple unit cei_s. 
Detailed descriptions of crystallographic terms are 
described m, which is hereby incorporated herein by 
reference m its entirety, including any drawings, 
figures, and tables. 

The term "monocimio unit cell" refers to a unit 
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cell where a * b - c ; a = v = 9 0"; ana p > 50"". 

The term "space group" refers to the symmetry of a 
ur.:: cell. In a space group designation (e.g., C2 ) the 
capital letter indicates the lattice type and the other 
5 symbols represent symmetry operations that can oe 
carried out on the unit cell without changing its 
appearance . 

The term "lattice" in reference to crystal 
structures refers to the array of points defined by the 
10 vertices of packed unit cells. 

The term "symmetry operations" refers to 
geometrically defined ways of exchanging equivalent 
Darts of a unit cell, or exchanging equivalent molecules 
between two different unit cells. Examples of symmetry 
15 operations are screw axes, centers of inversion, and 
mirror planes . 

In a preferred embodiment, the invention features a 
crystalline form, where the monociinic unit ceils have 
dimensions of about a=208,3 A, b=57.8 A, c=65.5 A and 
20 [5 = 107. 2°. 

In a preferred embodiment, the invention features a 
FGFR1 crystal, where the monociinic unit ceils have 
dimensions of about a=211.6 A, b=51.3 A, c=66.1 A and 



p — 1 'J / . / 



25 In another aspect the invention teatures a 

polypeptide corresponding to the catalytic domain or a 
protein tyrosine kinase, containing at least about 20 
amino acid residues upstream of the first glycine in the 
conserved glycine-rich region of the catalytic domain, 

30 and at least about 17 amino acid residues downstream of 
the conserved arginine located at the C-terminal 
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_ ^ - . ; , ^ cr _ n e invention can f ' 

".car are isolated, o r _ r - - ^ o ^ 



riy iso.acec" in reference to a coivDeccid* 



r puntiea. 

a polypeptide is 
meant a polymer of 6 , 12, 13 or more ammo acids 
conjugated to each other, including polypeptides that 
are isolated from a natural source or that are 
synthesized. The isolated polypeptides of the present 
invention are unique m the sense that they are not 
rouna m a pure or separated state m nature. Use of 
the term "isolated" indicates that a naturally occurring 
sequence has been removed from its normal cellular 
15 environment. Thus, the sequence may be m a cell -free 

solution or placed in a different cellular environment. 
The term does not imply that the sequence is the only 
ammo acid chain present, but that it is essentially 
free (about 90 - 95% pure at least) of material 
20 naturally associated with it. 

By the use of the term "enriched" m reference to a 
polypeptide it is meant that the specific amino acid 
sequence constitutes a significantly higher fraction (2 

3 roid; cf the total of ammo acids present in the 
ce - s zr solution of interest than m normal or diseased 
c e — s cr i- the cells from which the sequence was taken. 
This could be caused by a person by preferential 
reduction m the amount of other ammo acids present, or 
by a preferential increase m the amount of the specific 
ammo acid sequence cf interest, cr by a combination of 
the two. However, it should be noted that "enriched" 
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does not inply :ha: there are no ether arm no acid 
sequences present, just that tne relative axoun; of t he 
sequence cf interest has been s ignif leant ly increased . 
The term significant here is used to indicate that the 
5 level of increase is useful to the person nakmg such an 
increase, and generally means an increase relative to 
other amino acids of about at least 2 fold, more 
pre f erably at least 5 to 10 fold cr even more . The term 
also does not imply that there are no amino acids from 

10 other sources. The other source amino acids may, for 

example, comprise amino acids encoded by a yeast or 
bacterial genome, or a cloning vector such as pUC19. 
The term is meant to cover only those situat ions in 
which a person has intervened to elevate the proport ion 

15 of the desired nucleic acid . 

It is also advantageous for some purposes that an 
amino acid sequence be in puri f ied form . The term 
" purified" in reference to a polypeptide does not 
require absolute purity ( such as a homogeneous 

2 0 preparation) ; instead, it represents an indication that 
the sequence is relatively purer than in the natural 
environment ( compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of 

/ n \ r^..^,'-c^™-,4--: _ ~p « «_ a « — <c 

1 1 L / III! ,' . rui ill ^ CI L- ± W l i 'sJ J_ CX '_- J-GCtb L 'w l iiC 'w* JL CT 1. W _ 

25 magnitude, preferably two or three orders, and more 

preferably four or five orders of magnitude is expressly 
contemplated. The substance is preferably free of 
contamination at a functionally significant level, for 
example 90%, 95%, or 99% pure. 

30 In a preferred embodiment, the invention features a 

polypept ide corresponding to the catalytic domain of a 



BNSOOCID: <WO »607e35A2JA> 



WO 98/07835 



PCT/US97 14885 



20 



25 



• - — - 1-' - - ~ ~ * - - n e r£C6cic v ~~ 

. n e insulin rsceptcr, ever. t h c u c i 
■ S\ r be different. 



- n a preferred embc d i mer.t the m 7 
polypeptide ccrrespcnairg to :he catalytic domain of a 
non-receptor PTK , where one non- insulin receoto- 
tyrosine kinase is a cytoplasmic tyrosine kinase. 

In a preferred embodiment, the invention features a 
polypeptide corresponding to the catalytic domain of a 
receptor PTK , selected from one group consisting of FGF - 
R, PDGF-R, KDR , CCK4 , MET , TRrlA, AXL , TIE , EPK, RYK , 
DZ'R, ROS, RET , LTK , ROR1 , or MUSK. 

In a preferred embodiment, the invention features a 
polypeptide corresponding to the catalytic domain of a 
non-receptor PTK , selected from the group consisting of 
SRC, BRK, BTK, CSK, ABL, ZAP7C, FES, FAK , JAK , or ACK . 

In a preferred embodiment, the invention features a 
polypeptide corresponding to the catalytic domain of a 
?TK, having the amino acid sequence shown in Table 1 or 
Table 2 . 

In another aspect, the invention features a method 
for creating crystalline forms described herein. The 
method may utilize the polypeptides described herein to 
term, a crystal. The method comprises tne steps of; 

(a) mixing a volume of polypeptide solution 
with a reservoir solution; and 

(b) incubating the mixture obtained m steo 
(a) over the reservoir solution in a closed container, 
under conditions suitable for crystallization. 

These processes are described m detail in the 
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section e "idea "Detailed Description of the 
Invention . " 

In anotner aspect, the invention features a met nod 
of obtaining ?G? receptor tyrosine kinase domain 
5 polypeptide in crystalline form, comprising the steps 

of: (a) mixing a volume of polypeptide solution with an 
equal volume of reservoir solution, where the 
polypeptide solution comprises 1 mg/mL to 60 mg/m.L FGF- 
type tyrosine kinase domain protein, 10 mM to 200 mM 

10 buffering agent, 0 mM to 20 mM di thiothrei tol and has a 
pH of about 5.5 to about 7.5, and where the reservoir 
solution comprises 10% to 30% (w/v) polyethylene glycol, 
0.1 M to 0.5 M ammonium sulfate, 0% to 20% (w/v) 
ethylene glycol or glycerol, 10 mM to 200 mM buffering 

15 agent and has a pH of about 5.5 to about 7.5/ and (b) 
incubating the mixture obtained in step (a) over said 
reservoir solution in a closed container at a 
temperature between 0° and 25°C until crystals form. 

In a preferred embodiment, the invention features a 

2 *j met nod of od taming FGF receptor tyrosine kinase domain 
polypeptide in crystalline form, where the polypeptide 
solution comprises about 10 mg/mL FGF receptor tyrosine 
kinase domain, about 10 mM sodium chloride, about 2 mM 

dltViir^ViT-^il-r^l ahnnt- in mK/T Tv-i ^ ur*7 -s >-> ~ ~, ^ t t ~ ^ 
- ~ — *■ ~ ■ — - — — < — > - ^ ^ ^ mr, a ^j_o ii' — l. ctj.i'^ ^ici~3 a ^ n i_ 

25 about 6; the reservoir cutter comprises about 16% (w/v) 
polyethylene glycol (MW I00CC), about 0.3 M ammonium 
sulfate, about 5% ethylene glycol or glycerol, about 100 
mM bis-Tris and has a pH of about 6.5; and the 
temperature is about 4°C. 

30 In another preferred embodiment, the invention 

features a method of obtaining FGF receptor tyrosine 
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' ;,,aj — ^ ^ - : P ~ ? - - 3, 6: n crysta . ir.? i o r rr. , wri^r? t h c - 

.ae solution includes a :cr,ccu.nd sue- as a 
subs:ra:e, substrate analog, :nhici:cr cr 

-n still another preferred embodiment , the 
invention features a method of obtaining ?3F receptor 
tyrosine kinase domain polypeptide m crystalline form, 
where the compound is a nucleotide analog, such as a 
nc-n-hydroiyzable analog of AT?, or an mdolmone. 
Indolmone compounds have the general structural formula 
as described herein. 

In another aspect, the invention features a cDNA 
encoding an FGF receptor tyrosine kinase domain protein, 
where a coding strand of the cDNA has tne nucleotide 
sequence of SSQ ID NO : 5 . 

Another aspect of the invention relates to a method 
of determining three dimensional structures of PTKs with 
unknown structure by utilizing the structural 
coordinates of Table l, Table 2, Table 3, and Table 4. 
These methods can relate to homology modeling, molecular 
replacement, and nuclear magnetic resonance methods. 

In a preferred embodiment, the invention relates to 
a method of determining three dimensional structures of 
PTKs with unknown structures by utilizing the 
coordinates of Table 1, Table 2, Table 3<, or Table 4 m 
conjunction with the amino acid sequences of PTKs. This 
method of homology modeling comprises the steps of: (a) 
aligning the computer representation of an amine acid 
sequence of a PTK with unknown structure with that of a 
PTK with known structure, where alignment is achieved by 
matching homologous regions of the ammo acid seauences; 
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io) transferring tne computer representation of an ax.inc 
acid structure in the PTK sequence of Known structure to 
a computer representation of a structure of the 
corresponding amino acid in the PTK seauence with 
5 unknown structure; and (c) determining low energy 
conformations of the result ing PTK structure . 

The term "amino acid sequence" describes the order 
of amino acids in the amino acid chain comprising a 
polypeptide corresponding to the catalytic domain of a 
10 PTK. 

The term "aligning" describes matching the 
beginning and the end of two or more amino acid 
sequences. Homologous am_no acid sequences are placed 
on top of one another during the alignment process. 
15 The term "homologous" describes amino acids' in two 

sequences that are identical or have similar side-chain 
chemical groups (e.g. , aliphatic, aromatic, polar, 
negatively charged, or positively charged) . 

The term "corresponding" refers to an amino acid 
20 that is aligned with another in the sequence alignment 
mentioned above . 

The term "determining the low energy conformation" 
describes a process of changing the conformation of the 
PTK structure such that the structure is of lev; free 
energy. T^e PTK struct are may or may not nave molecules, 
such as modulators bound to it. 

The term "low free energy" describes a state where 
the molecules are in a stable state as measured by the 
process. A stable state is achieved when favorable 
30 interactions are formed within the complex. 

The term "favorable interactions" refers to 
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relates :c a T.echcd of determining tnree dimensional 
structures cf ?TKs with unknown structure. This method 
is accomplished oy applying tne structural coordinates 
cf Table 1, Table 2, Taoie 3, or Table 4 to an 
incomplete X-ray crystaliograpnic data set for a ?T?h 
The method comprises tne steps of: ( a ^ aiicnma the 
positions of atoms in tne unit -ell oy matching electron 
diffraction data from two crystals, where one data set 
is complete and the other is incomplete; and (b) 
determining a low energy conformation of the resultiric 
PTK structure . 

The term "incomplete data set" relates to a X-ray 
crystallographic data set that does not have enough 
information to give rise to a three dimensional 
structure . 

In another preferred embodiment, the invention 
relates to a method of determining three dimensional 
structures of PTKs with unknown structure by applyina 
the structural coordinates of Table 1, Table 2, Table 3, 
or Table 4 to nuclear magnetic resonance (NMR) data cf a 
?TK . This method comprises the steps of: (a) 
25 determining the secondary structure of a PTK structure 
using NKR data; and (b) simplifying the assignment of 
through-space interactions cf amino acids. The PTK 
structure may not be complexed with compounds or 
modulators . 

30 term "secondary structure" describes the 

arrangement of ammo acids m a three dimensional 
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structure, such as in a-helix cr 3-sheet: elements. 

The tern "through- space interactions 7 ' defines the 
orientation of the secondary structural elements m the 
three dimensional structure and the distances between 
5 amino acids from different portions of the amino acid 
sequence . 

The term "assignment" defines a method of analyzing 
NMR data and identifying which ammo acids give rise to 
signals in the NMR spectrum. 

10 In another aspect, the invention features a method 

of identifying potential modulators of PTK function. 
These modulators are identified by docking a computer 
representation of a structure of a compound with a 
computer representation of a cavity formed by the 

15 active-site of a PTK. The computer representation of 
the PTK active-site structure can be defined by 
structural coordinates . 

The term "chemical group" refers to moieties that 
can form hydrogen bonds , hydrophobic, aromatic, or ionic 

20 interactions. 

The term "docking" refers to a process of placing a 
compound in close proximity with a PTK. The term can 
also refer to a process of finding low energy 
conformations of the compound / P T K complex. 

25 A preferred embodiment of the invention is a method 

of identifying potential modulators of PTK function. 
The method involves utilizing the structural coordinates 
or a PTK three dimensional structure. The structural 
coordinates set forth m Table 1, Table 2, Table 3, and 

30 Table 4 can be utilized. The method comprises the steps 
of: (a) removing a computer representation of a PTK 
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riming a :cn:cr^a:;on of the complex with a 
favorable geometric fit: and favcrable complementary 
:r.:erac:icns; and c identifying ccnpcunas tnat best 
nt tne PTK active-site as potential modulators of PTK 
function. The initial PTK structure nay or nay not have 
compounds bound to it . 

Tne term "favorable geometric fit" refers to a 
conformation of tne compound- PTK complex where tne 
surface area of the compound is m close proximity with 
the surface area of the active-site without forming 
unfavorable interactions. Unfavorable interactions can 
be sterio hindrances between atoms in the compound and 
atoms m the PTK active-site 

The term "favorable complementary interactions" 
relates to hydrophobic, aromatic, ionic, and hydrogen 
bond donating, and hydrogen bond accepting forces formed 
between the compound and the PTK active-site. 

The term "potential" qualifies the term "modulator 
of PTK function" because the potential modulator or PTK 
function has not yet been tested for activity in vitro 
or in vivo. 

The term "best fit" describes compounds that 
compiexed the most surface area in the complex and/or 
form the most favorable complementary interactions with 
tne PTK m the screen in a criven experiment 

Another preferred embodiment of the invention is a 
method of identifying potential modulators of PTK 
function. The method involves utilizing a three 
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dimensional srruciure or a PTK, wicn or wi:nou: 
compounds bound to it. The method comprises zhe seeps 
cf : (a) modifying a computer representation cf a PTK 
having one or more compounds bound to it, wnere the 
5 computer representations of the compound or compounds 
and PTK are defined by structural coordinates; lb) 
determining a conformation of the complex with a 
favorable geometric fit and favorable complementary 
interactions; and (c) identifying the compounds -that 
10 best fit the PTK active-site as potential modulators of 
PTK function. 

The term "modifying" relates to deleting a chemical 
group or groups or adding a chemical group or groups. 
Computer representations of the chemical groups can be 
15 selected from a computer data base. 

Yet another preferred embodiment of the invention 
is a method of identifying potential modulators of PTK 
function by operating modulator construction or 
modulator searching computer programs on the compounds 
20 complexed with the PTK. The method comprises the steps 
of: (a) removing a computer representation of one or 
more compounds complexed with a PTK; and (b) searching a 
data base for compounds similar to the removed compounds 
using a compound searching computer program, or 
25 replacing portions of the compounds complexed with the 
PTK with similar chemical structures from a data base 
using a compound construction computer program, where 
<-h- representations of the compounds are defined by 
structural coordinates . 
30 T he term "operating" as used herein refers to 

utilizing the three-dimensional conformation of 
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smputer aata case :ha; has a s:~:Iar aecmetric 
- c -- re =5 compounds tr.at can bind tc a PTK. T'^e 

compound can also have similar chemical crroups 
as -.r.e co-pounds char are either bound to the ?TK cr 
once bound tc the ?TK . The similar chemical groups can 
term complementary interactions with the PTK. 

The term "compound searching computer program" 
describes a computer program that searches computer 
representations of compounds from a computer data base 
that nave similar three dimensional structures and 
similar chemical groups as a compound of interest. The 
compound cr interest is preferably an mdolinone 
compound . 

The term "similar chemical structures" refers to 
chemical groups that share similar geometry as oortions 
of the compounds m complex with the PTK or compounds 
removed from the PTK structure. Similar chemical 
structures can also refer to chemical groups that may 
form similar complementary interactions as portions of 
the compounds in complex with the PTK or compounds 
removed from the PTK structure. 

The term "replacing structures" refers to removing 
a portion cf the compounds in complex with the PTK cr 
compounds removed from the PTK structure and connecting 
tne broken bonds to a similar chemical structure. 

The term "compound construction computer program" 
describes a computer program that replaces computer 
representations of chemical groups in a compound with 
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groups from a computer data base. The ::^ound is 
preferably an mdolmone compound. 

The term "similar chree dimensional structure" 
describes two molecules with nearly identical shape and 
5 volume . 

In another preferred embodiment of the invention, 
the ?TK structures used in the modulator design or 
identification method of the invention are defined by 
the structural coordinates of Table 1 , Table 2 , Table 3 , 

10 or Table 4 . 

The methods for using the crystalline forms and 
three dimensional structures of the invention can relate 
to a broad range of protein kinases. Thus, in preferred 
embodiments, the invention relates to a receptor PTK . 

15 The receptor PTK can be selected form the group 

consisting of FGF-R, PDGF-R, FLK, CCK4 , MET, TRKA, AXL, 
TIE, EPH , RYK , DDR , RQS , RET, LTK , ROR1 , and MUSK. The 
PTK may also exist as a non-receptor PTK. The non- 
receptor PTK can be selected from the group consisting 

20 of SRC, BRK, BTK , CSK , ABL, ZAP70, FES, FAK , JAK, and 

ACK. 

In another aspect , the invention features a 
potential modulator of PTK function identified by 

1 " w_ i * ^ O ^ ^ O ^ _ V V C> * V X1J W 1 ± ^ V . - ' L^ii . 

^ A preterred embodiment of the invention is that the 

potential modulator of PTK function is an oxindolinone 
or a thiolindolinone of formula I or II disclosed above. 

Another aspect of the invention is a method for 
synthesizing a potential modulator of PTK function or 

3 0 its pharmaceut real ly acceptable salts , isomers , 

metabolites, esters, amides, or prodrugs oy a standard 
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- -"- sr.ctner aspect: , the ir.ver.ticr. feature" a 
cf identifying a potential modulator of ?TK function as 
a -cauxarcr ?TK function. The method cc-prises the 
seeps of: ;a; administering a potential modulator of PTK 
function to cells; (b; comparing the level of PTK 
phosphorylation between cells not administered the 
potential modulator and cells administered the potentia" 1 
moauiatcr; and ! c< identifying the potential modulator 
as a modulator of PTK function based on the difference 
in the level of PTK phosphorylation. 

The term "cells" refers to any type of cells either 
primary or cultured. Primary cells can be extracted 
directly from an organism while cultured cells rapidly 
divide and can oe cultured in many successive rounds. 
Cells can be grown m a variety of containers including, 
but not limited to flasks, dishes, and well plates. 

The term "administer" refers to a method of 
delivering a compound to ceils. The compound can be 
prepared using a tarrier such as dimethyl sulfoxide 
(DMSO) in an aqueous solution. The aqueous solution 
comprising the compound, also termed an "aqueous 
preparation", can be simply mixed into the medium 
bathing the layer of cells or mi crom j ected into the 
cells themselves. The compounds may be administered to 
the cells using a suitable buffered solution. 

The term "suitable buffered solution" refers to an 
aqueous preparation of the compound that comprises a 
salt that can control the pH of the solution at low 
concentrations. Because the salt exists at low 



20 
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con cent: rations , the sal:, preferablv aoes not alter the 
function of the cells. 

The term VV ?TK phosphorylation" refers to the 
presence of phosphate on the ?TK . Phosphates on ?TKs 
£ can be identified by antibodies tnat bind them 

specifically with high affinity. 

In another aspect, the invention features a method 
of identifying a potential modulator of ?TK function as 
a modulator of ?TK function. The method comprises the 

10 steps of: (a) administering a potential modulator cf PTK 

function to cells; (b) comparing the level of ceil 
growth between cells not administered the potential 
modulator and cells administered the potential 
modulator; and (c) identifying the potential modulator 

15 as a modulator of PTK function based on the difference 

in cell growth. 

The term "cell growth" refers to the rate at which 
a group of cells divides. Cell division rates can be 
readily measured by methods utilized by those skilled in 

20 the art. 

Another aspect of the invention features a method 
of diagnosing a disease by identifying cells harboring a 
PTK with inappropriate activity. The method comprises 
the steps of: (a) administering a modulator of PTK 

25 r unco ion to cel^s, (b) comparing the rate ot cell growth 

between cells not administered the modulator and ceUs 
administered the modulator; and (c) diagnosing a disease 
oy characterizing cells harboring a PTK with 
inappropriate activity from the effect of the modulator 

30 on the difference in the rate of cell growth. The 

modulator can be identified by the methods of the 
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PTK by a growth r actor , alterations in the activity of 
r ./.-specinc phosphatase , over - excress i on of a PTK ~ n 
cel -' cr mutations ir. the catalytic region of tne PTK 
itself . 



The term "signal transduction process" describes 
tne steps in a cascade of events where an extracellular 
signal is transmitted into an intracellular signal. 

The tern U ?TK- specif ic phosphatase" describes an 

15 enzyme that dephosphorylat es a particular PTK and 

thereby regulates that PTK ' s activity. 

Another aspect of the invention is a method of 
treating a disease associated with a PTK with 
inappropriate activity m a cellular organism, where the 

20 method comprises the steps of: (a) administering the 

modulator of PTK function to the organism, where the 
modulator is in an acceptable pharmaceutical 
preparation; and (b) activating or inhibitma the PTK 
function to treat the disease. 

25 The term "organism" relates to any living being 

comprised of at least one cell. An organism can be as 
simple as one eukaryotic cell or as complex as a mam.mal . 

The term "administering", m reference to an 
organism, refers to a method of introducing the compound 
to the organism. The compound can be administered when 
the cells or tissues of the organism exist within the 
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organism cr ou:side of zhe organise. Cells ex:s:ma 
our side the organisr. can be maintained or arown m cell 
culture dishes. For cells harbored within the crganisn, 
many techniques exist m the art to administer 
compounds, including (but not limited to) oral, 
parenteral, dermal, and injection applications. For 
cells outside of the patient, multiple techniques exist 
in the art to administer the compounds, including (but 
not limited to) cell microinjection techniques, 
transformation techniques, and carrier techniques. 

The term "pharmaceut icai ly acceptable composition" 
refers to a preparation comprising the modulator of PTK 
activity. The composition is acceptable if it does not 
appreciably cause irritations to the organism 
15 administered the compound. 

Preferred embodiments of the of the invention are 
that the PTK is a receptor PTK selected from the qrout 
consisting of FGF-R, PDGF-R, FLK-i , CCK4 , MET , TRKA, 
AXL, TIE, EPH, RYK , DDR, ROS , RET, LTK , R0R1 , and MUSK. 
20 Other preferred embodiments of the invention are that 
the PTK is a non-receptor PTK selected from the group 
consisting of SRC , BRK, BTK, CSK, ABL, ZAP70, FES, FAK, 
JAK, and ACK . 

The summary of the invention described above ^ ^ 



30 



iiuii-ixuiiLiiiy etna uLiier teacures ana advantages ot the 
invention will be apparent from the following detailed 
description, and from the claims. 

BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 provides a ribbon diagram of the structure 
of FGFR1 showing the side chains of tyrosines Tyr-653 
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k =*- a-, jt^r , Ofr -Oil 

2 strand 21-25, 2"", 28 , r.uclect::de-i::nd;^ moo 
catalytic loop, ac:r;a:icr. mop and -nase insert 



;l O x e CU ^ ' 



.'ne termini are denoted 



between 22 and 23 is disordered, 
indicated by a break in tne chain in this region. 

FIG. 2 provides a stereo view of a e„ trace of FGFRI 
snown in tne same orientation as FIG . 1, witn every 
tenth amino acid residue narked with a filled circle and 
10 every twentieth ammo acid residue labeled with a 

residue number. 

FIG. 3 provides a structure-based sequence 
alignment of human fibroblast growth factor receptor 1 
{ FGFR1 ) , human fibroblast growth factor receptor 2 
J - 5 - — -2/, numan ribroblasc growth factor receptor 3 

( FGFR3 ) , human fibroblast growth factor receotor 4 
( FGFR4 ) , a D . malanogaster homolog { DFGFR1 ) a C. 
elegans homolog (EGL-15) and insulin receptor tyrosine 
kinase ( IRK) . 

20 FIGS. 4A and 4B provide ribbon diagrams of the 

N-terminal lobes (4A) and C-terminal lobes (4B) of FGFRi 
and IRK in which the C a atoms of the (J sheets (4A) or oc- 
heiices (4B) of the two proteins have been superimposed. 
FIG. 5 illustrates the side-chain positions of the 

25 tyrosine autophosphorylat ion sites of FGFRI on the 
backbone representation of FGFRI . 

FIGS. 6A and 6B are ammo acid sequence alignments 
of the catalytic domains of ?TKs , including receptor and 
non-receptor type PTKs . FIG . 5A depicts one 

3 0 representative member from each of the eiahteen 

subfamilies of receptor tyrosine kinases. FIG. SB 
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aepicts one represent: at ive T.e^rer from each cf the 
subfamilies of cytoplasmic tyrosine kinases. In FIGS. 
6A and 63 highly conserved residues are boxed. The 
position of the glycine-rich domain , kinase insert, 
catalytic loop, and activation loop are indicated. Th* 
numbering is for human FGF-receptor . 



BRIEF DESCRIPTION OF THE CPY^TALlOHP APH I C ATOf^ jr 

STRUCTURAL COORDINATES 
The crystallographic structural coordinates are 
located at the end of the section entitled "Examples" 
and before the claims. Three sets of coordinates can be 
found in the Protein Data Bank under accession names 
1FGK, 1AGW, and 1FGI . The 1FGK coordinates correspond 
15 to those listed in Table 1, the 1AGW coordinates 

correspond to those listed in Table 4, and the 1FGI 
coodinates correspond to those listed in Table 3 . The 
IAGW and 1FGI coordinate sets will be publicaiiy 
available in March 1998. 

- — — - ^^wv^^.es Luc aiutiuu bLTuct ur e coordinate s 
of native FGFR1 crystals of the invention as determined 
by X-ray crystallography; and 

Table 2 provides the atomic structure coordinates 

Of FGFR 1 ! AMP- PTP rn-rrvcfa 1 c r>-F ^ V,^ -; +_ ■ _ 

determined by X-ray crystallography. 

Table 3 lists cry s tail ogr aph i c coordinates defining 
the three dimensional structure of FGF-R1 complexed with 
3 - [ ( 3 - ( 2 -carbcxyethyl ) -4 -methylpyrrol - 5 -yl J methylene] - 2 - 
indolinone. The columns (from left to right; are 
descriptions cf the atoms by number and type, amino acid 
and number containing the atom, the x coordinate, y 
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:ese parameters are 



rrar 



; c r c i r. aces 



:nree dimensional structure of FGF-R: 
:omplexed with 3 - ; 4 - [ 4 - f orny Ip ipera z me - 1 -y 1 ) 



LO-inone . The columns are as 



iescribed :r. Table 3 . 



15 



20 



DETAILED D ESCRIPTION OF THE I WF.XT TOM 
The present invention is directed to the design and 
identification of modulators of protein tyrosine kinase 
function that are ?TK subfamily specific, non- 
hydrolyzable under acidic conditions, and highly 
oioavailable . The three dimensional structures of a ?TK 
optionally complexed with compounds can facilitate 
design and identification of modulators of ?TK function. 

Protein tyrosine kinases (?TKs) comprise a large 
and diverse class of enzymes. Schlessmger and Ullrich, 
1992, Neuron 9: 383-391. The FTK family is subdivided 
into members that are receptors and those that are non- 
receptors. The PTK receptor family contains multiple 
subfamilies, one of which is the fibroblast growth 
factor receptor (FGF-R) PTK which is a molecule 
25 implicated m regulating angicgenesis a well as cellular 

proliferation and differentiation. Givol and Yayon, 
1992, FASEB J. 6 (25): 3362-3369. 

FGE-R1 can mediates cellular functions by its role 
in one or more cellular signal transduction processes. 
jC Ce^^u^ar signal transduction processes comprise multiole 

steps that convert an extracellular signal into an 



BNSDOCID <WO 9907836*2 J A> 



WO 98/07835 



PCT/US97/14885 



intracellular signal . 

Receptor ?TK mediated signal transduction 15 
initiated by binding a specific extracellular ligand, 
followed by receptor dimeri zat ion , and subsequent 
5 autophosphorylation of the receptor PTK. The pnoschate 
groups are binding sites for intracellular signal 
transduction molecules which leads to the formation of 
protein complexes at the cell membrane. These complexes 
facilitate an appropriate cellular effect (e.g., cell 

10 division, metabolic effects to the extracellular 

microenvironment ) in response to the iigand that began 
the cascade of events . 

Receptor PTKs function as binding sites for several 
intracellular proteins Intracellular PTK binding 

15 proteins are divided into two principal groups: (1) 
those which harbor a catalytic domain; and (2) those 
which lack such a domain but serve as adapters and 
associate with cat alytically active molecules. Songyang 
et al - , 1993, Cell 72:767-778. SH2 ( src homology; 

2Q domains are common adaptors found in proteins which 
directly bind to the receptor PTK. SH2 domains are 
harbored by PTK binding proteins of both groups 
mentioned above. Fantl et al . , 1992, Cell 63:413-423; 
Songyang et al . . 19 94, Mol . Cell. Biol, 24:2 77 7-2785); 
Songyang et al . , 1993, Cell /2:7b/ -778; and Koch et a 2 . , 
19 91, Science 2 5 2 : 6 6 6 - 6 7 8 . 

The specificity of the interactions between 
receptor PTKs and the SH2 domains of their binding 
proteins is determined by the amino acid residues 

30 immediately surrounding the phosphorylat ed tyrosine 

residue. Differences in the binding affinities of 5H2 
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~- ci- . , Ce^_ iT • • " 7 " 7 6 . .hese observations 

~ ^ — — - ---- - on c_ eacr. recec: or PTK is 

u.xi . ..^ its pa. .err. oz expression ana 

-igana ava;.ac:.:ty but also by one array of downstream 
signal transduction pathways that are activated bv a 
particular receptor. Thus, PTRs provide a controlling 
regulatory rc.e m signal transduction processes as a 
consequence of aut cpn osphorylat ion . 

PTK-mediated signal transduction regulates cell 
proliferative, differentiation, and metabolic responses 
in cells. Therefore, inappropriate PTK activity can 
result m a wide array of disorders and diseases. These 
disorders, wnich are described below, nay oe treated bv 
the modulators of PTK function designed or identified by 
tne methods disclosed herein. 

The present invention also relates to crystalline 
polypeptides corresponding to the catalytic domain of 
receptor tyrosine kinases. Such tyrosine kinases 
include receptors of a class that are not covalently 
cross- linked but are understood to undergo ligand- 
ihduced dimerization, as well as cytoplasmic tyrosine 
kinases. Preferably, tne crystalline catalytic domains 
are of sufficient quality to allow for the determination 
of a three-dimensional X-ray diffraction structure to a 
resolution of about 1.5 A to about 2.5 A . The invention 
also relates to methods for preparing and crystallizing 
tne polypeptides. The polypeptides themselves, as well 
as information derived from their cry; 



:ai structures can 
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ce used to analyze and modify tyrosine kinase a — 
as well as to identify compounds that interact with tr.e 
catalytic domain. 

The polypeptides of the invention are designed on 
the basis of the structure of a region m the 
cytoplasmic domain of the receptor tyrosine kinase that 
contains the catalytic domain. 3y way of illustration, 
FIG. 6A shows the ammo acid sequence alignment of the 
catalytic domains of eighteen human receptor tyrosine 
kinases; one representative member from each of the 
eighteen subfamilies is shown. FIG. 6B shows the 
alignment for cytoplasmic kinases. The applicants have 
discovered and determined the boundaries of the domain 
required for crystallization of the resulting 
polypeptide. Surprisingly, these boundaries differ from 
that required for catalytic activity. For example, 
referring to FIG. 6A, the domain required for catalvr in- 
activity is generally believed to span about 7 amino 
acid residues upstream of the first glycine (FIG. 6A 
20 residue number 485) of the N • terminal glycine-rich 

region through about 10 residues beyond the C-terminal 
conserved arginine (FIG. 6A, residue number 744) . 
However, the additional sequence upstream of the N- 
terminal glycine-rich region and downstream of 
^ o terminal conserved arginint; can be required for 

crystallization. In particular, at least about 20 amino 
acid residues (•+-/- 5 amino acid residues; upstream of 
the first glycine (j ,e, , FIG. 6A, residue number 435; in 
the conserved glycine-rich region of the catalytic 
domain, and at least about 17 amino acid residues (+/- 5 
amino acid residues) downstream of the conserved 
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r :r.eer a n: 



^ n nose s 1 c u a c i o n s where the resulting col ^ 1 ^ , 

contains cysteine residues that interfere with 

crystallization ( e.g. . cysteine residue numbers 488 and 

584 in the FGF-RI sequence shown m FIG. 6A) , such 

cysteine residues can be substituted with an aocronriac 

ax.ino acid that aces net readily form covalent cones 

with other amino acid residues under crystallization 

conditions; e.g. , by substituting the cysteine with Ala, 

Ser or Gly. Any cysteine located in a non-helical or 

ncr.-3-stranded segment, based on secondary structure 

assignments, are good candidates for replacement. For 

example, cysteines located m regions cc rresoondmc co 

tne glycine-rich-loop, the kinase insert, the 

juxtamembrane region or the activation loop are prime 

candidates for replacement. However, substitutions of 

cysteine residues that are conserved among tne kinases 

' e ■ 9 - ' FIG - 6A at positions 725 and 736) are preferably 
avoided . 



1 ■ PTK Associated Diseases 

Blood vessel proliferative disorders refer to 
angiogenic and vascuiogenic disorders generally 
resulting m abnormal proliferation of blood vessels. 
The formation and spreading of blood vessels play 
important roles m a variety of physiological processes 
such as embryonic development, corpus luteum formation, 
wound healing and organ regeneration. They also clay a 
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pivocai ro^e ir. cancer aevelcpT.en: . Other examples cf 
blood vessel proliferation disorders include arthritis, 
where new capillary blood vessels invade the -cir.: and 
destroy cartilage, and ocular diseases, like diabetic 
5 retinopathy, where new capillaries m the retina invade 

the vitreous, bleed and cause blindness. Conversely, 
disorders related to the shrinkage, contraction or 
closing of blood vessels are implicated in such diseases 
as restenosis . 

10 Fibrotic disorders refer to the abnormal formation 

of extracellular matrix. Examples of fibrotic disorders 
include hepatic cirrhosis and mesangial cell 
proliferative disorders. Hepatic cirrhosis is 
characterized by the increase in extracellular matrix 

15 constituents resulting in the formation of a hepatic 

scar. Hepatic cirrhosis can cause diseases such as 
cirrhosis of the liver. An increased extracellular 
matrix resulting in a hepatic scar can also be caused by 
viral infection such as hepatitis. 

20 Mesangial cell proliferative disorders refer to 

disorders brought about by abnormal proliferation of 
mesangial cells. Mesangial proliferative disorders 
include various human renal diseases, such as 
glomerulonephritis , diabetic nephropathy, malignant 

25 nephrosclerosis, thrombotic microangiopathy syndromes, 

transplant rejection, and glomerulopathies. The PDGF-R 
has been implicated in the maintenance of mesangial ceil 
proliferation. Floege et al . , 1993, Kidney 
International 43 : 47S- 545 . 

30 PTKs are directly associated with the cell 

proliferative disorders described above. For example, 
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: one aeve-CDnent or cancer. S c m e of 

lancer 63:227-233; Torp en al . , 1992. A PAT'S 112:713- 
5 ~ l 9 HER 2 7 neu 'Slamon et al . , 2 98 9, Science 244:70—712; 

and PDGF-R (Ku^iabe e: al . , 19 92, Oncogene 7:627-633.. are 
over-expressed ;n many tumors and/or persistently 
activated by autocrine loops. In fact, PTK over- 
expressicr. (Akbasak and Suner - Akbasak et al . 1932, J. 
10 Neurol. Sci . 211:119-133; Dickson et al . , 1992, Cancer 

Treatment Res. 61:249 -273 ,- Korc et al . , 1992, J. Clin. 
Invest. 90 : 1352-1360; and autocrine loop stimulation 
(Lee and Doncghue , 1992, J. Cell. Biol. 118:1057-1070; 
Korc et al . , supra; Akbasak and Suner -Akbasak et al . , 
15 supra) account for the most common and severe cancers. 

For example, EGFR is associated with squamous cell 
carcinoma, astrocytoma, glioblastoma, head and neck 
cancer, lung cancer and bladder cancer. HER2 is 
associated with breast, ovarian, gastric, lung, pancreas 
20 and bladder cancer. PDGF-R is associated with 

glioblastoma, lung, ovarian, melanoma and prostate 
cancer. The receptor PTK c-met is generally associated 
with hepatocarcinogenesis and thus hepatocellular 
carcinoma. Additionally, c-met is linked to malignant 
25 tumor formation. More specifically, c-met has been 

associated with, among other cancers, colorectal, 
thyroid, pancreatic and gastric carcinoma, leukemia and 
lymphoma. Additionally, over-expression cf the c-met 
gene has been detected in oatients with Hodckms 
disease, Burkitts disease, and the lymphoma cell line. 
The IGF- 1 receptor PTK , in addition to beina 
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implicated m nutritional support and m tvoe-II 
diabetes, is also associated with several tvoes of 
cancers. For example, IGF- I has been implicated as an 
aut ocrine growth st lmulator for several tumor tyoes , 
5 e.g. human breast cancer carcinoma cells {Arteaga et 

al . , 1989, J. Clin. Invest. 84:1418 -1423 1 and small lung 
tumor cells (Macauiey et al . , 1990, Cancer Res. 50:2511- 
2517). In addition, IGF- I, integrally involved in the 
normal growth and differentiation of the nervous system, 

10 appears to be an autocrine stimulator of human gliomas. 
Sandberg-Nordqvist et al., 1993, Cancer Res. 53:2475- 
2478. The importance of the IGF- IR and its modulators 
in cell oroli f era t ion is further suooort ed ^ r r-v^ 
that many cell types in culture (fibroblasts, epithelial 

15 cells, smooth muscle cells, T - lymphocytes , myeloid 

cells, chondrocytes, osteoblasts, the stem cells of the 
bone marrow) are stimulated to grow by IGF- I. Goldring 
and Goldring, 1991, Eukaryotic Gene Expression 1:301- 
326. In a series of recent publications suggest that 

20 IGF - IR plays a cent ra 1 role in the mechanisms of 

transformation and, as such, could be a preferred target 
for therapeutic interventions for a broad spectrum of 
human malignancies. Baserga, 1995, Cancer Res. 55:249- 
252; Baserga, 1994, Cell 75:927-930; Coppola et al . , 

2 5 19 94, Moi. Cell. Biol. 14:4586-4595. 

The association between abnormalities in receptor 
PTKs and disease are not restricted to cancer, however. 
For example, receptor PTKs are associated with metabolic 
diseases like psoriasis, diabetes mellitus, wound 

30 healing, inflammation, and neurodegenerative diseases. 

EGF-R is indicated in corneal and dermal wound healing. 
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fa ..i-^^^ WfcAt3 . ^ ...omp^ece ccrre.a:;cn oetweer. 

^ _ ^^^^ ana -r.e.r tnerapeu::: :r.ci;cac:cns 

is sec forth m Plowman al . , 1994, DN&P 7:334-339. 

5 Non-receptor PTKs, including src, abl , fps, yes, 

fyn, lyn, lck, blk, hck, fgr, yrk (reviewed by Bolen et 
al., 1992, FASEB J. 5:3403-3409;, are involved m the 
proliferative and metabolic signal transduction pathways 
also associated with receptor PTKs . Therefore, the 
10 present invention is also directed towards designing 
modulators against this class cf FTKs . For example, 
m.utated src (v-src; is an oncoprotein (pp60 vsrc ) in 
chicken. Moreover, its cellular homolog, the proto- 
oncogene pp60- s - transmits oncogenic signals of many 
15 receptors. For example, over -expression of EGF-R or 

HER2 /neu in tumors leads to the constitutive activation 
cf pp6 0 c " s «, which is characteristic of the malignant 
cell but absent m the normal cell. On the other hand, 
rr.ice deficient for the expression of c-src exhibit an 
osteopetrotic phenotype, indicating a key participation 
cf c-src in osteoclast function and a possible 
involvement in related disorders. Similarly, Zap 7C is 
implicated in T-celi signaling. Both receptor PTKs and 
non-receptor PTKs are connected to hyperimmune 
disorders . 

The instant invention is directed in part towards 
designing modulators of PTK function that could 
indirectly kill tumors by cutting off their source of 
sustenance. Normal vasculogenesis and angiogenesis play 
important roles m a variety of physiological processes 
such as embryonic development, wound healing, organ 
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regeneration ana female reproductive processes suon as 
follicle development in one corous luteum durma 
ovulation and placental growth after pregnancy. Foikman 
and Shing, 1952, J. Biological Chem. 2c 7 : 10931-34 . 
5 However, many diseases are driven by persistent 

unregulated or inappropriate angiogenesis . For example, 
in arthritis, new capillary blood vessels invade the 
joint and destroy the cartilage. In diabetes, new 
capillaries in the retina invade the vitreous, bleed and 

10 cause blindness. Foikman, 1987, m : Congress of 

Thrombosis and Haemoscasis (Verstraete, en. al , eds . ; 
Leuven University Press, Leuven, pp. 583 -596, Ocular 
neovascularization is th^ nnqf rnmm.nn r-anao ^ - wi -; ^ ~ a 
and dominates approximately twenty (20) eye diseases. 

15 Moreover, vasculogenesis and/or angiogenesis can be 

associated with the growth of malignant solid tumors and 
metastasis. A tumor must continuously stimulate the 
growth of new capillary blood vessels for the tumor 
itself to grow. Furthermore, the new blood vessels 

20 embedded in a tumor provide a gateway for tumor cells to 
enter the circulation and to metastasize to distant 
sites in the body. Foikman, 1990, J". Natl. Cancer Inst. 
82:4-G; Klagsbrunn and Soker, 1993, Current Biology 
3:699-702; Foikman, 1991, J. Natl., Cancer Inst. 82:4-6/ 

25 Weiarier et ai . , 1991, New Engl, J, Med. 324:1-5. 

Several polypeptides with in vitro endothelial cell 
growth promoting activity have been identified. 
Examples include acidic and basic fibroblastic growth 
factor iaFGF, 3FGF) , vascular endothelial growth factor 

30 (VEGF) and placental growth factor. Unlike aFGF and 

3FGF , VEGF has recently been reported to be an 
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e: a - ■ ' ' — ^ici. 9ne-. 265:19451-19566. 

Thus, identifying the specific recectors that bind 
^ r '^~ or V £GF is important for understanding endothelial 

cell proliferation regulation. Two structurally related 
receptor PTKs that bind VEGF with high affinity are 
identified: the flt-l receptor (Shibuya et al . , 1990, 
Oncogene 5:519-524; De Vries et al . , 1992, Science 
10 255:989-991, and the KDR/FLK-1 receptor, discussed in 

the U.S. Patent Application No. 08/193,829. In 
addition, a receptor that binds aFGF and BFGF is 
identified. Jaye et al . , 1992, Biochem. Biophys . Acta 
1135:185-199). Consequently, these receptor PTKs most 
15 likely regulate endothelial cell proliferation. 

FGFRs play important roles in angiogenes is , wound 
healing, embryonic development, and malignant 
transformation. Basilico and Moscatelli, 1992, Adv. 
Cancer Res. 59:115-165. Four mammalian FGFR ( FGFRi -4 ) 
20 have been described and additional diversity is 
generated by alternative RNA splicing within the 
extracellular domains. Jaye et al . , 1992, Biochem. 
Biophys. Acta 1135:185-199. Like other receptor PTKs, 
dimerization of FGF receptors is essential for their 
25 activation. Soluble or cell surf ace -bound heparin 

sulfate proteoglycans act in concert with FGF to induce 
dimerization (Schlessinger et al . , 1995, Ceil 53:357- 
360) , which leads to autophosphoryiat ion of specific 
tyrosine residues in the cytoplasmic domain. Mohammadi 
30 et al., 1996, Mol . Cell Biol. 16:977-989. 

Mutations in three human FGF receptor genes, FGFRl , 
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OR 



FC-FR2, ana FG.-R3, nave beer. implicated a vari, 
human generic skeletal disorders. Mutations 
and FGFR2 result m the premature fusion of the flat 
bones of the skull and cause the craniosynostosis 
syndromes, such as Apert ( FGFR2 ) (Wilkie et al . , 1994, 
Nat. Genet. 8:269-274), Pfeiffer (FGFR1 and FGFR2 ) 
(Muenke et al . , 1994, Nat. Genet. 8:269-274), 
Jackson-Weiss ( FGFR2 ; (Jabs et al . , 1994, Nat. Genet. 
8-.21S-219) and Crouzon ( FGFR2 ) (Jabs et al . , 1994, Nat. 
Genet. 3:275-279] syndromes. In contrast mutations m 
FGFR3 are implicated in long bone disorders and cause 
several clinically related forms of dwarfism including 
achondroplasia (Shiang et al . , 1994, Cell 7 8: 335-342!, 
hypochondrcplasia (Bellus et al . , 1995, Nat. Genet. 
15 .10:357-359) and the neonatal lethal thanatophoric 

dysplasia (Tavormina et al . , 1995, Nat. Genet. 9:321- 
328) . It has been shown that these mutations lead to 

.^.v. a^u. j. vaux^n ot the tyrosine Kinase activity 

of FGFR3 (Webster et al . , 1996, EMBO J. 15:520-527). 
Furthermore gene-targeting experiments in mice have 
revealed an essential role for FGFR3 in developmental 
bone formation (Deng et al . , 1996, Cell 84:911-921). 

Another major role proposed for FGFs in vivo is the 
induction of angiogenesis (Folkman and Klagsbrun, 1987, 
-^c^c <.jo.tt*.,. mererore, inappropriate expression 
ot FGFs or of their receptors or aberrant function of 
the tyrosine kinase activity could contribute to several 
human angiogenic pathologies such as diabetic 
retinopathy, rheumatoid arthritis, atherosclerosis and 
tumor neovascularization (Klagsbrun and Edelman, 1989, 
Arteriosclerosis 9:269). Moreover, FGFs are thouaht to 
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~ . , . ^ , - . - _ ^ . , ~. eri^oa.n- -GtRj. ana are 
amplified ir. a population of breast cancers :Adnane et 
al.. 1991, Oncogene 5:653-663;. Over-expression of F3- 
receptors has been also detected m huxan pancreatic 
cancers, astrocytomas, salivary gland ader.osarcomas , 
.-.ape si sarcomas, ovarian cancers and prostate cancers. 

Evidence, such as the disclosure set forth in 
copending U.S. Application Serial No. 08/193,823, 
strongly suggests that VEGF is not only resconsibie for 
endothelial cell proliferation, but also is a prin,e 
regulator of normal and pathological angicgenes is . See 
15 generally, Klagsburn and Soker, 1993, Current Biology 
3:699-702; Houck et al . , 1992, J. Biol. Chew. . 
25-7:26031 -26037 . Moreover, it has been shown that 
K3R/FLK-1 and fit-1 are abundantly expressed in the 
proliferating endothelial cells of a growing tumor, but 
not in the surrounding quiescent endothelial cells. 
Plate et al . , 1992, Nature 355:845-848; Shweiki et al . , 
1992, Nature 355:843-845. 

The invention is directed to designing and 
identifying modulators of receptor and non-receptor PTK 
functions that could modify the inappropriate activitv 
cf a PTK involved with a clinical disorder. The 
rational design and identification of modulators of p^k 
-unctions can be accomplished by utilizing the 
structural coordinates that define a PTK three 
dimensional structure. 



20 



25 
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I I . Modulators cf PTK functions as Therapeutics for 

As a consequence of the disorders discussed above, 
scientists m the biomedical community are searching for 
5 modulators of ?TK functions that down- regulate si anal 

transduction pathways associated with inappropriate PTK 
act ivity . 

In particular, small molecule modulators of PTK 
functions are sought as some can traverse the cell 

10 membrane and do not hydrolyze in acidic environments. 
Some compounds have already been discovered. For 
example, bis monocyclic, bicyclic or heterocyclic aryl 
compounds (PCT WO 92/20642) , vinylerie -azaindole 
derivatives (PCT WO 94/14308) 1 - eye iopropyl - 4 -pyridyl - 

15 quinolones (U.S. Patent No. 5,330,992), styryl compounds 
(U.S. Patent No. 5,217,999), styryl - substituted pyridyl 
compounds (U.S. Patent No. 5,302,606), certain 
quinazoline derivatives (EP Application No. 0 566 266 
Al ) . seleoindoies and selenides (PCT WO 94/03427} , 

20 tricyclic polyhydroxylic compounds (PCT WO 92/21660) , 
and benzylphosphonic acid compounds (PCT WO 91/15495) 
are described as PTK inhibitors. 

Although some modulators of PTK function are known, 
many of these are not specific for PTK subfamilies and 

25 will therefore cause multiple side-effects as 

t hiol indci mone family., however, are specific for the 
FGF receptor subfamily (U.S. Patent Application Serial 
No. 08/702,232, filed August 23, 1996, invented by Tang 
30 et al . , entitled "Indolincne Combinatorial Libraries and 

Related Products and Methods for the Treatment of 



BNBOOCID: <WO »607836A2JA> 



WO 98/07835 



PCT7US97/14885 



C a : q ^ 



~ _ b ~ a 



aaai ' 



compounds of :he oxmdcl ir.one tnicl mdclmone faT.ilv are 
r.cr.-r.yaro.yzacie ir. acidic ccnuicicns and can ce hicrhlv 
c i cava i_ able . 

The invention provides information regarding the 
specific interactions between a ?TK and compounds of the 
oxindclinone/thiolmdolinone family. Although the use 
or X-ray crystallography has provided three dimensional 
structures of other ?TKs, the PTKs in these structures 
are not corr.plexed with PTK subfamily specific, 
hydrolysis resistant, highly bioavailable small 
molecules. The X-ray crystallography techniques used in 
tne current invention resolve interactions between a PTK 
and compounds m complex with it at the atomic level, 
which provides detailed information reaardma the 
orientation of chemical groups defminq an effective 
modulator of PTK function. 



11 1 • Crystalline Tyrosine Kinases 

Crystalline PTKs of the invention include native 
crystals, derivative crystals and co-crystals. The 
native crystals of the invention generally comprise 
substantially pure polypeptides corresponding to the 
tyrosine kinase domain in crystalline form.. 

It is to be understood that the crystalline 
tyrosine kinase domains of the invention are not limited 
to naturally occurring or native tyrosine kinase 
domains. Indeed, the crystals of tne invention include 
mutants of native tyrosine kinase domains. Mutants of 
native tyrosine kinase domains are obtained by replacing 
at .east one amino acid residue in a native tyrosine 
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Kinase aonair. wi^h a different ar.ir.c acid residue, 3 r b v 
adding or deleting anno acid residues within the native 
polypeptide or at the X- or C~ terminus cf the native 
polypeptide, and nave substantially the same three - 
5 dimensional structure as the native tyrosine kinase 

domain from which the mutant is derived. 

By having substantially the same tnree -dimens icnal 
structure is meant having a set of atomic structure 
coordinates that nave a root -mean-square deviation of 

10 less than or equal to about 2A when superimposed with 

the atomic structure coordinates of the native tyrosine 
kinase domain from which the mutant is derived when at 
least about 50% to 10 0% of the Ca atoms of the native 
tyrosine kinase domain are included m the 

15 superposition. 

Amino acid substitutions, deletions and additions 
which do not significantly interfere with the three- 
dimensional structure of the tyrosine kinase domain will 
depend, in part, on the region of the tyrosine kinase 

2 0 domain where the substitution, addition or deletion 
occurs. In highly variable regions of the molecule, 
such as those shown in FIG. 6, non-conservative 
substitutions as well as conservative substitutions may 
be tolerated without significantly disrupting the three- 

2 5 dimensional structure ot the molecule. In highly 

conserved regions, or regions containing significant 
secondary structure, such as those regions shown in FIG . 
6, conservative amino acid substitutions are preferred. 
Conservative amino acid substitutions are well- 

30 known in the art, and include substitutions made on the 
basis of similarity in polarity, charge, solubility, 
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acid residues 



negatively marged ammo acid 



n c xUce asoartic 



acid and glutamic acid; positively charged a-mc acids 
mciuae :ysme and arginme; ammo acids with uncharged 
polar head groups having similar nydrophilicicy values 
include the following: leucine, isoleucme, valine; 
glycine, alanine; asparagme, glutamme; serine, 
threonine; phenylalanine, tyrosine. Other conservative 
ammo acid substitutions are well known m the art. 

For tyrosine kinase domains obtained m whole or m 
part by chemical synthesis, the selection of ammo acids 
available for substitution or addition is not limited :o 
the genetically encoded ammo acids. Indeed, the 
mutants described herein mav ccnt=-~ _^o^«- i - 

encoded ammo acids. Conservative ammo acid 
substitutions for many of the commonly known non- 
genetically encoded amino acids are well known in the 
art. Conservative substitutions for other amino acids 
can be determined based on their physical properties as 
compared to the properties of the genetically encoded 
amino acids . 

In some instances, it may be particularly 
advantageous or convenient to substitute, delete and/or 
add ammo acid residues to a native tyrosine kinase 
domain m order to provide convenient cloning sites in 
cDNA encoding the poiypepcide, to aid m purification of 
the polypeptide, and for crystallization of the 
polypeptide. Such substitutions, deletions and/or 
additions which do not substantially alter the three 
dimensional structure of the native tyrosine kinase 
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apparent :: :r.ose of ordinary skiII in 

the arc . 

It snould be noted that the mutants contemplated 
herein need not exhibit PTK activity. Indeed, ammo 
acid substitutions, additions or deletions that 
interfere with the kinase activity of the tyrosine 
kinase domain but which do not significantly alter the 
three-dimensional structure of the domain are 
specifically contemplated by the invention. Such 
crystalline polypeptides, or the atomic structure 
coordinates obtained therefrom, can be used to identify 
compounds that bind to the native domain. These 
compounds may affect the activity or the native domain. 
The derivative crystals of the invention generally 
15 comprise a crystalline tyrosine kinase domain 

polypeptide in covalent association with one or more 
heavy metal atoms. The polypeptide may correspond to a 
native or a mutated tyrosine kinase domain. Heavy metal 
atoms useful for providing derivative crystals include, 
20 by way of example and not limitation,, gold, mercury, 
etc . 

The co-crystals of the invention generally comprise 
a crystalline tyrosine kinase domain polypeptide in 
association with one or more compounds. The association 

2 ^ n~\Z*\r Vi O r-mra 1 nnf /— « -w- „ i ^ ^ _ _ _ i _ 

. . j _ „ ^ * w*. — * * w w j_ a j. ^ a i " a i^iil . l-> 1 1 <. : i < < ) P"- r. ? <_> i_t n_ CX S 

include, but are not limited to, cofactors, substrates, 
substrate analogues, inhibitors, allosteric effectors, 
etc . 
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transcript: icnal / t ransia: icnal control signals . These 
met nods include m vi vro recombinant DNA techniques , 
synthetic techniques and in nvo re comb mat ion ./ genet ic 
recombination . See, for example , the techniques 
5 described in Maniatis ec al . , 1983 and Ausubel er al . , 
1989 . 

A variety of host-expression vector systems may be 
utilized to express the tyrosine kinase domain coding 
sequence . These include but are not limited to 

10 microorganisms such as bacteria transformed with 

recombinant bacteriophage DNA, piasmid DNA or cosmid DNA 
expression vectors containing the tyrosine kinase domain 
coding sequence; yeast transformed with recombinant 
yeast expression vectors containing the tyrosine kinase 

15 domain coding sequence; insect ceil systems infected 
with recombinant virus expression vectors ( e.g. , 
baculovirus ) containing the tyrosine kinase domain 
coding sequence; plant cell systems infected with 
recombinant virus expression vectors ( e.g. , cauliflower 

2 0 mosaic virus, CaMV; tobacco mosaic virus, TMV) or 

transformed with recombinant piasmid expression vectors 
( e.g. , Ti piasmid) containing the tyrosine kinase domain 
coding sequence; or animal cell systems. The expression 
elements of these systems vary in their strength and 

2 5 specificities . 

Depending on the host /vector system utilized , any 
of a number of suitable transcription and translation 
elements, including constitutive and inducible 
promoters, may be used m the expression vector. For 

30 example, when cloning in bacterial systems, inducible 

promoters such as pL of bacteriophage X, piac, ptrp, 
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s u c n as c :~ ~ cacuiovirus p-clvhedrin o r c rr, 
when clomna :r r. ~ 



--L^.^i.^ i.. ce.. systems, proTio:ers derived 

frOTT1 zhe genome of plan- cells ( e.g. . neat shock 
promoters,- the promoter for the small subunit cf 
RUB I SCO; the promoter for the chlorophyll a/b binding 
protein; cr from plant viruses - e.g. . the 35S RNA 
promoter of CaMV; the coat protein promoter of TMV; may 
be used; when cloning m mammalian cell systems, 
promoters derived from the genome of mammalian cells 
( g • g • > nietallothionein promoter) or from mammalian 
viruses (s^cu, the adenovirus late promoter; the 
vaccinia virus 7.5K promoter) may be used; when 
generating cell lines that contain multiple copies of 
the tyrosine kinase domain DNA , SV40-, BPV- and EBV- 
based vectors may be used with an appropriate selectable 
marker . 

Methods describing methods of DNA manipulation, 
vectors, various types of cells used, methods of 
incorporating the vectors into the cells, expression 
techniques, protein purification and isolation methods, 
and protein concentration methods are disclosed m 
detail with respect to the protein PYK-2 in PCT 
publication wo 96/18736. This publication is 
incorporated herein by reference in its entirety, 
including any drawings. Those skilled in the art will 
appreciate that such descriptions are applicable to the 
present invention and can be easily adapted to it. 
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r^ysLdl Growth 

Crystals are grown from an aqueous solution 
containing the purified and concentrated polypeptide oy 
a variety of techniques. These techniques include 
5 catch, liquid, bridge, dialysis, vapor diffusion, and 

hanging drop methods. Mcpherson, 1982, John Wiley, New 
York; McPherson, 1990, Eur. J. Biochem. 189:1-23; 
Webber, 1991, Adv. Protein Chem. 41:1-36, incorporated 
by reference herein in its entirety, including all 
10 figures, tables, and drawings. 

Generally, the native crystals of the invention are 
grown by adding precipitants to the concentrated 
solution of the polypeptide corresponding to the PTK 
catalytic domain. The precipitants are added at a 
15 concentration just below that necessary to precipitate 

the protein. Water is removed by controlled evaporation 
to produce precipitating conditions, which are 
maintained until crystal growth ceases. 

For crystals of the invention, it has been found 
20 that hanging drops containing about 2.0 uL of tyrosine 

kinase domain polypeptide (10 mg/mL in lOmM Tris-HCl, pH 
8.0, 10 mM NaCl and 2 mM dithiot hrei tol ) and 2.0 /iL 
reservoir solution (16% w/v polyethylene glycol MW 
10000, 0.3 M (NH 4 ) 2 S0 4 , 5% v/v ethylene glycol or 

i i — -u-;^ rp^.;^ t^u c ^ \ GiiQnpnHpH ove T 0-5 

; r j i \y c ~ <-* r ( j i din i i u u nii'i j. o ^ j — t m * * [~ — - - 

*- — — j. - - - 

tt.L reservoir buffer for about 3-4 weeks at 4°C provide 
crystals suitable for high resolution X-ray structure 
determination . 

Those of ordinary skill in the arc will recognize 
30 that the above - described crystallization conditions can 
be varied. Such variations may be used alone or m 



BNSDOCID: <WO &807836A2_IA> 



WO 98/07835 



PCT7US97/14885 



15 



_ wi ._»_n^ng r - -*/pcpt icte _;_er.;ra::or.s betiweer. about 1 
. . . . . . : ^ ct * d ^ _ i_ m g , ■ ttu- , -ris-HCT_ ccr^csr ^ v - " ^ ors 

ct>__ _ . ;: .«. a c, Ju ^ m ^ ( aitnict r.rei tol 

concentrations between about 0 mM and about 20 mM , pH 
ranges between about 5.5 and about -.5; and reservoir 
solutions containing polyethylene glyool concentrations 
between about 10% and about 30% fw/v; , polyethylene 
glycol molecular weights between about 1000 and about 
2 ^' 000 ' -NH;- 2 SC 4 concentrations between about 0.1 v and 
about 0.5 M, ethylene glycol or glycerol concentrations 
oetween about 0% and about 20% (v/v) , bis-Tris 
concentrations between about 10 mM and about 200 mM , pH 
ranges between about 5.5 and about 7.5 and temperature 
ranges between about 0° C and about 25 °C. Other buffer 
solutions may be used such as HEPES buffer, so long as 
one desired pH range is maintained. 

Derivative crystals of the invention can be 
Obtained by soaking native crystals m mother liquor 
containing salts of heavy metal atoms. It has been 
found that soaking a native crystal in a solution 
containing about 0.1 mM to about 5 mM thimerosal, 4- 
chloromeruribenzoic acid or KAu(CN) 2 for about 2 hr to 
about 72 hr provides derivative crystals suitable for 
25 use as isomorphous replacements in determining the X-ray 
crystal structure of the tyrosine kinase domain 
po lypept ide . 

Co-crystals of the invention can be obtained by 
soaking a native orystal m m.other liquor containing 
3 0 compound that bind the kinase domain, or described 

above, or can be obtained by co- crystal iizmg the kinase 



20 
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domain polypeptide in the cresence — v-~ 

binding compounds . 

For co-crystals of tyrosine kinase domain 
polypeptide in co-complex with AMP -PC?, it ha- be<=^ 
5 found that co-crystallizing the kinase domain 

polypeptide in the presence of AMP-PCP using the above- 
described crystallization conditions for obtaining 
native crystals with a polypeptide solution additionally 
containing 10 mM AMP-PCP and 20 mM MgCl 2 yields co- 
10 crystals suitable for the high resolution structure 
determination by X-ray crystallography. Of course, 
those having skill in the art will recognize that the 
concentrations of AMP-PCP and MgCl 2 in the polypeptide 
solution can be varied, alone or m combination with the 
15 variations described above for native crystals. Such 

variations include polypeptide solutions containing AMP- 
PCP concentrations between 0.1 mM and 50 mM and MgCl, 
concentrations between 0 mM and 50 mM. 

Crystals comprising a polypeptide corresponding to 
a PTK catalytic domain cotupkxed with a compound can be 
grown by one of two methods. In the first method, the 
modulator is added to the aqueous solution containing 
the polypeptide corresponding to the PTK catalytic 
domain before the crystal is grown. In the second 
method, the modulator is soaked into an already existing 
crystal of a polypeptide corresponding to -a PTK 
catalytic domain . 



20 
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:r. one illustrative embodiment, the invent 10- 
les :rysta.s of FGFRl . The crystals -were obtained 
by the net hods provided in the Examples . The FGFRl 
5 crystals, which may be native crystals, derivative 

crystals or co-crystals, have mcnociir.ic unit ceils 

unit ceils wherein a/b^c; a=y=90 o ; and B>90°: and 
space group symmetry C2 . There are twc FGFRl molecules 
in the asymmetric unit, related by an approximate two- 
10 fold axis . 

Two forms of crystalline FGFRl were obtained. In 
one form, (designated "C2-A form"), the unit ceil has 
dimensions of a=208.3 A, b=57.2 A, c=65.5 A and 
3=107.2°. In another form (designated "C2-8 form";, the 
15 unit cell has dimensions of a=211.6 A, b=51.3 A, c=66.l 
A and 6=107.7°. 

Three distinct two-fold related FGFRl diners are 
observed m both the C2-A and C2-B forms of the FGFRl 
crystal, one non-crystallographicaiiy related dimer and 
two crystallographically related dimers . The non- 
crystailographically related dimer comprises the two 
molecules in the asymmetric unit. The residues making 
up the dimer interface are located in C-terminal lobe. 
In this dimer, the C-terminai icbes abut with the N- 
2 5 terminal lobes distal to one another. The total amount 
of surface area buried in the surface is about 950 A 2 . 
Very few of the interactions m the interface are of a 
specific nature, e.g.,, hydrogen - bonding or close packing 
of hydrophobic residues. 

There are two crystallographically-related dimers 
the C2 lattice. in the first dimer, the residues 
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that constitute :ne oin^r _.._e^«~- — 
m the 3-sheet of tne X-termmal looe a:::- acia 
residues 477, 479, 496, 506, 503 and 496;. Tne total 
surface area buried in this interface is about 670 A- . 

^^•h^ qnpri'K Three hvaroohcDic 
The interactions are rather specui.. 

■ - u ■ ^ -vo nar-iaUv S 0 1 V6H t - 6XD O 3 6 d in the 

residues whicn are par-ia-iy 

monomer, Vai-479, He-493 and Val-508, come together 
with their two-fold-related residues to form a compact 

, • -, Th-Q -lua capped on either side by 

hydrophobic plug, xn^s yiag - = La ff 

a' salt bridge between Arg-477 and Glu-496. In addition, 
two main-chain hydrogen-bonds connect the ^-sheets of 

.- -v,^ o-a>-r of S3 (amino acid residues 
the two monomers at tne Start or ^ 

506 and 508). The residues in this dimer interface, or 
their residue character, are generally conserved ir. the 
15 mammalian FGF receptors, but not in the invertebrate 

homologues . 

The other crystaliographically-related dimer buries 

• • ■ 4-„«.-p = -o rVii o dimer. the aC 

about 1650 A' in its interface. — — — 

helices of the two monomers are nearly parallel and 
contact each other at their C- terminal ends. Met-534 
and Met-537 are in van der Waals contact with their two- 
fold-related residues. Other hydrophobic contacts 
involve Pro-466 with Ile-648 and Pro-469 with Ile-676 
and Thr-678. In addition, hydrogen bonds (side-chair, to 

v 7\rrr.A7ri ^ nd Lvs - 6 1 8 and 

2b mdili - Liicaj.ii ; ot^- 

... C „ Q r,ln-4S4, and there are several water 

iLweeu n^o-'-j-i-^ 

.ecules that bridge the two monomers through hydrogen 



20 



oei 
mo" 



bonding . 

in tne C2-B form of the crystal, uhe ,~ 

30 this second crystaliographically-related dimer are 
shifted slightly with respect to one another (6° 



BNSOOCID <WO 9807636A2JA> 



WO 98/07835 



PC. 14885 



:ace is scrr.ewha* 



•r ... cx _ e : 

^ ..^.^ _ _ iA e ^v;c nc.ecuies cor ; pr:sina tine dip.' 

point: in the sarr.e direction and are reasonably close 
one another . 



j e ti e r m l n i n c 



Dimensions and the Thr* 



Jingngional Structure of a Pol ypeptide or PGlvp pp^rio 
lomnlex 



Once :he crystal is grown, it can oe placed m a 
giass capillary tube and mounted onto a holding device 
connected to an X-ray generator and an X-ray detection 

15 device. Collection of X-ray diffraction patterns are 

well documented by those in the art. Ducruix and Geige, 
1992, IRL Press, Oxford, England, and references cited 
therein. A beam of X-rays enter the crystal and then 
diffract from the crystal. An X-ray detection device 

20 can be utilized to record the diffraction patterns 

emanating from the crystal. Although the X-ray 
detection device on older models of these instruments is 
a piece of film, modern instruments digitally record X- 
ray diffraction scattering. 

25 Methods for obtaining the three dimensional 

structure of the crystalline form of a peptide molecule 
or molecule complex are well known m the- art. Ducruix 
and Geige, 1992, IRL Press, Oxford, England, and 
references cited therein. The following are steps in 

3 0 the process of determining the three dimensional 

structure of a molecule or complex from X-ray 
diffraction data. 
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After the X-ray diffraction patterns are collected 
from the crystal, the unit cell dimensions and 
orientation m the crystal can be determined. They can 
be determined from the spacing between the diffraction 
5 emissions as well as the patterns made from, these 

emissions. The unit ceil dimensions are characterized 
m three dimensions in units of Angstroms (one A = 10" 1: 
meters) and by angles at each vertices. The symmetry of 
the unit cell in the crystals is also characterized at 
10 this stage. The symmetry of the unit cell in the 

crystal simplifies the complexity of the collected data 
by identifying repeating patterns. Application of the 
symmetry and dimensions of the unit ceil is described 
below . 

15 Each diffraction pattern emission is characterized 

as a vector and the data collected at this stage of the 
method determines the amplitude of each vector. The 
phases of the vectors can be determined using multiple 
techniques. In one method, heavy atoms can be soaked 

2 0 into a crystal, a method called isomorphous replacement, 
and the phases of the vectors can be determined by using 
these heavy atoms as reference points in the X-ray 
analysis. Otwinowski, 1991, Daresbury, United Kingdom, 
80-86. The isomorphous replacement method usually 
c i r 'j. 1 1 e s more them one heavy atom derivative In 

another method, the amplitudes and phases of vectors 
from a crystalline polypeptide with an already 
determined structure can be applied to the amplitudes of 
the vectors from a crystalline polypeptide of unknown 

30 structure and consequently determine the phases of these 
vectors. This second method is known as molecular 
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reference mus" - 

^ — - - - — - — - - ■ jarszci , - ^ ^ , r^^r 3C6ids J. 2. ■ 2 ^ " - " ^ ^ 

Thus, the vector information fror. a PTK of known 
structure, such as those reported herein, are useful for 
the molecular replacement analysis of another PTK with 
unknown structure. 

Once the phases of tne vectors describing the unit 
cell cf a crystal are determined, the vector amplitudes 
and phases, unit cell dimensions, and unit cell symmetry 
can ce used as terms in a Fourier transform function. 
The Fourier transform function calculates the electron 
density in the unit cell from these measurements. The 
electron density that describes one of the molecules or 
one cf the molecule complexes in the unit cell can be 
referred to as an electron density map. The ammo acid 
structures of the sequence or the molecular structures 
of compounds complexed with the crystalline polypeptide 
may then fit to the electron density using a variety of 
computer programs. This step of the process is 
sometimes referred to as model building and can be 
accomplished by using computer programs such as 

TOM/FRODO. Jones, 1985, Methods in Enzymology 115:151- 
171 . 

A theoretical electron density map can then be 
calculated from the amino acid structures fit to the 
experimentally determined electron density. The 
theoretical and experimental electrcn density maps can 
be compared to one another and the agreement between 
these two maps can be descrioed by a parameter called an 
R-f actor. A low value for an R-factor describes a high 
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degree of overlapping electron density between a 
theoretical and experimental electron density map. 

The ^-factor is then minimized by using computer 
programs that refine tne theoretical electron density 
5 map. A computer program such as X-PLOR can be used for 

model refinement by those skilled m the art. Brunger , 
1992, Nature 355:472-475. Refinement may be achieved in 
an iterative process. A first step can entail altering 
the conformation of atoms defined m an electron density 

10 map . The conformations of tne atoms can be altered by 

simulating a rise m temperature which will increase the 
vibrational frequency of the bonds and modify positions 
of atoms in the structure. At a particular point m the 
atomic perturbation process, a force field, which 

15 typically defines interactions between atoms in terms of 

allowed bond angles and bond lengths, Van der Waals 
interactions, hydrogen bonds, ionic interactions, and 
hydrophobic interactions, can be applied to the system 
of atoms. Favorable interactions may be described in 

20 terms of free energy and the atoms can be moved over 

many iterations until a free energy minimum is achieved. 
The refinement process can be iterated until the R- 
f actor reaches a minimum value. 

The three dimensional structure of the molecule or 

25 molecule complex is described by atoms that fit the 

theoretical electron density characterized by a minimum 
R-value. A file can then be created for the three 
dimensional structure that defines each atom by 
coordinates in three dimensions. Examples of sucn 

30 structural coordinate files are defined in Table 1, 

Table 2, Table 3, and Table 4. 
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The present invention provides high- resolution 
three- dimensional structures ana atomic structure 
coordinates of crystalline FGFRl and crystalline 
FGFRl :AMF-?C? cc-ccmpiex as determined by X-ray 
crystallography. The specific methods used to obtain 
tne structure coordinates are provided m the examples. 
Tne atomic structure coordinates of crystalline FGFRl , 
obtained from the C2-A form of the crystal to 2.0 A 
resolution, are listed in Table 3; the coordinates of 
crystalline FGFRl : AM?- PCP co-complex, obtained from the 
C2-A form of the crystal to 2.3 A resolution are listed 
in Table 4 . 

Those having skill in the art will recognize that 
atomic structure coordinates as determined by X-ray 
crystallography are not without error. Thus, it is to 
oe understood that any set of structure coordinates 
obtained for crystals of FGFRl , whether native crystals, 
derivative crystals or co-crystals, that have a root 
mean square deviation ( " r . m . s . d . » ) of less than or equal 
to about 1.5 A when superimposed, using backbone atoms 
(N, C a , C and O) , on the structure coordinates listed in 
Table 3 or Table 4 are considered to be identical with 
25 the structure coordinates listed in the Tables when at 
least about 50% to 100% of the backbone atoms of FGFRl 
are included in the superposition. 

Referring now to FIG. 1, the overall structure of 
FGFRl is ci-lobate. The N-terminal lobe of FGFRl spans 
3 0 ammo acid residues 456-567 (FIG. 3; and comprises a 

curled 3-sheet of five ant i -parallel strands (Bl-35. and 



20 
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- , ■ ^---v- n -na' lobe scans anr.rnc acia 
one a-neux .a^, . - i-.ia- 

residues 566 - 755 (FIG. 3 and comprises owe 3-soraods 
(37, B8) and seven ot-helices iaD , aE, aEF , a?-al). Tne 
secondary structure ncnienc.a.u: C _w„w** ----- —-a 
5 IRK (Hubbard et al . , 1954 which in turn is cased or. tne 
assignments for cAPK (Knighton et al . , 199-)- FIG . 2 
shows a stereo view of a a, trace of FGFR1 in the same 

orientation as FIG. 1. 

A structure-based sequence alignment of the 
1C tyrosine kinase domains of human fibroblast growth- 
factor receptor 1 (human FGFR1 ; labelled FGFR1 ) , human 
fibroblast growth factor receptors 2 , 3 and 4 (labelled 
FGFR2 , FGFR3 and FGFR4 , respectively), a D . melanogastez 
homologue (labelled DFDFR1 ) , a C elegans homologue 
15 (labelled EGL-15) and insulin receptor kinase (labelled 
IRK), is shown in FIG. 3. The sequence of FGFR1 , which 
is not shown in FIG. 3 is identical to the sequence of 
FGFR1 except that FGFR1 has the following amino acid 
substitutions and additions: Cys-488 - Ala, Cys-584 - 
20 Ser, Leu-4 57 - Val and an additional five N- terminal 
amino acids (Ser-Ala- Ala-Gly-Thr) . The secondary 
structure assignments for FGFR1 and IRK were obtained 
using the Kabsch and Sander algorithm (Kabsch and 
Sander, 1983) as implemented in PROCHECK (Laskowski et 

. _ r^nr? v-nnnnmr c; p> m ip.nces , a period 

- — -, ' iou«(> ' t , • i L*-t r <-3 l~ j-w^^-*^ — tL- 

Z 3 Cl-L . , -l. -< ~> -> , • 

represents sequence identity to FGFR1 . In the IRK 
sequence, residues that are identical to FGFR1 are 
highlighted. A hyphen denotes an insertion. 

The numbers under tne El^-j-d seq^e^e ^^-s_.._ 
30 fractional solvent accessibility ( FSA2 ) of the residue 
in the FGFR1 structure. The FSA ratio is the ratio of 
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toe s^.ven: ac^~^^_- i-r:a:e area a residue :r. a 

3iy-X-G_y :r:pep::ae compared :: :r.a: m :r.e FGFR1 
structure. A value cf C represents an FSA cetween 2 . G G 
and ^-^5; I represent an FSA cetween G . 1 0 and G.19, 
5 e:c. The higher tine value, the more sol vent - exposed the 

residue. An asterisk or pound sign in the FSA line 
indicates tnat the residue 'asterisk' cr side chain 

pound sign) is not included m the atcn model due to 
disorder. The numbers below the FSA line are the FSAs 
10 for those residues that form, part of a dimer interface. 

The amino acid residue numbers for FGFR1 , and hence 
FGFR1 , and IRK provided in FIG. 3 are used m the 
discussion that follows. Significant differences m the 
N-terminal lobe of FGFRl as compared to IRK are found in 
15 the loops between 3 strands and m aC . Residues from the 

end of Si through the beginning of 32 (ammo acid 
residues 485-490) form the nucleotide -binding loop, 
named because of its role in AT? coordination. This 
residue stretch contains the protein kinase - conserved 
20 GXGXXG sequence motif, where X is any amino acid. This 
loop is poorly ordered in one FGFRl molecule in the 
asymmetric unit and disordered ( i.e. . not included in 
the atomic model) in the other FGFRl molecule in the 
asymmetric unit. The loop between 31 and S3 is 
25 disordered m both FGFRl molecules comprising the 

asymmetric unit . 

Referring now to FIG. 4A, which provides a ribbon 
diagram of the N-terminai lobes cf FGFRl and IRK in 
which the C a atoms of the 3 -sheets have been 
superimposed, it can be seen tnat m FGFRl aC is lonper 
by one helical turn than in IRK and is oriented such 



3 0 
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that residues Lys-514 and Glu-531, wh;cr. are conserved 
m protein kinases, form a sale bridge ..represented by a 
black line) . While net intending to be bound by theory, 
this salt bridge is believed to be important for proper 
5 positioning of the conserved lysine side chain, which 
coordinates two phosphate oxygens of ATP. The salt 
bridge is observed in the structures of cA?K (Knighton 
et al . , 1991) and mi togen - act ivat ed protein kinase 
(MAPK) (Zhang et al . , 1994). 

10 Referring now to FIG. 43, which provides a ribbon 

diagram of the C-ternmal lobes of FGFR1 and IRK in 
which the C a atoms of the a-helices have been 
superimposed, a significant difference is found in the 
C-terminal helix of FGFR1 when compared to IRK; helix ocl 

15 of FGFR1 is longer by seven residues (two helical turns) 
than its counterpart in IRK. The extended length of ctl 
is presumably important in the biological functioning of 
FGF receptors, since the tyrosine autophospnorylation 
site to which an SH2 domain of PLCy binds is six 

2 0 residues C- terminal to tnis henx. 

The structure of FGFR1 displays an open disposition 
of the N- and C-terminal lobes. Despite having 
different sets of lattice contacts, the two FGFR1 
molecules in the asymmetric unit have only a 2° 

2 5 difference m relative lobe orientation. - t appears as 
though the stearic interaction between residues in aC 
(Glu-531 and Met-534) with Phe-642 and Gly-643 of the 
protein kinase - conserved DFG sequence at the beginning 
of the activation loop accounts for the open 

30 conformation of FGFR1 . 

The active site of FGFR1 is characterized by at 
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a *' a --^w^-^^^j — — *^ i . * u u _ o c c . ^ n _ i e in ^ 

structure of IRK, m wr^" ^y^--"^' - s « ^ ^ ~ „^_, rc 

site of the molecule, the active sites of both FGFRl 
molecules m the asymmetric unit are unoccupied. 

The activation loop, which regulates 
phosphorylation, is characterized by at least resides 
640 to 663. Quite surprisingly, while the activation 
loops of FGFRl and IRK contain the same number of ammo 
acid residues and share greater than 50% sequence 
homology, the paths of the polypeptide chains are 
strikingly dissimilar, diverging at Ala-640 (GIy-1149 in 
IRK) and reconvergmg at Val-664 (Val-1173 in IRK; 
Tyr-653 and Tyr 564 are not oound in the active site. 
Instead, these residues point away from it. Tyr-653 is 
in van der Waals contact with several hydrophobic 
residues (Val-664, Leu-672 and Phe-710) and is hydrogen- 
bonded via its hydroxyl group to a backbone carbonyl 
oxygen (Leu-672) . Tyr-654 is more solvent exposed than 
Tyr-653, and its only van der Waals contact is with Val- 
706. Temperature factor data suggest that the 
activation loop is relatively mobile and adopts multiple 
conformations . 

The catalytic loop of protein kinases lies between 
secondary structure elements ocE and 37 and contains an 
invariant aspartic acid residue (Asp-623 in FGFRl) which 
serves as the catalytic base in the phosphotransf er 
reaction, abstracting the proton from the hydroxyl group 
of the substrate tyrosine, serine or threonine. The 
catalytic loop sequence of FGFRl comprises at least 
residues His - 6 2 1 to Asn-62S (amino at id seauence 



BNSDOCID: <WO 9807836A2_IA> 



WO 98/07835 



PCT/US97 14885 



HRTLAARN, , and is identical to :na: for ZRK and t.os: 
receptor and non-receptor ?TKs . 

In addition to the two tyrosine aut ophosphory lat ion 
sites m the activation loop (Tyr-653 and Tyr-654', 
5 there are four other aut ophosphorylat ion sites present 

m the FGFR1 crystals of the invent ion : one m the 
j uxtamembrane region ( Tyr- 4 6 3), two in the kinase insert 
(Tyr-583 and Tyr-585) and one in the C- terminal lobe 
(Tyr-730) (Mohammadi et al . , 1996). They exhibit 
1 0 varying degrees of conservation in mammal i an FG? 

receptors: Tyr-463 and Tyr-585 in FGFR1 and 2; Tyr-583 
in FGFR1, 2 and 3; and Tyr- 73 0 in FGFR 1, 2, 3 and 4 
{ r _lo . 3 ) . 

Referring now to FIG. 5, the positions of the 

15 aut ophosphory lat ion sites are mapped onto the FGFR1 
structure. The j uxtamembrane site (Tyr-463) and the 
residues N - terminal to it are disordered i n one of the 
FGFR1 molecules in the asymmetric unit. In the other 
molecule in the asymmetric unit Tyr-463 is involved in a 

20 lattice contact. 

The kinase insert region (the region between 
helices aD and aE) contains autophosphorylat ion sites 
Tyr-583 and Tyr-585 and is disordered in both FGFR1 
molecules in the asymmetric unit of the C2-A form of the 

25 uiybLdi . In Li it; C2-3 lux ml , aeveidl idLLice uuntac t s 

partially pin down this region in one of the two FGFR1 
molecules in the asymmetric unit, allowing a trace of 
the polypeptide chain to be made . There is no well - 
defined secondary structure for these residues. Tyr - 

30 73 0, situated in aH in the C- terminal lobe , is nearly 
buried and the s ide - chain hydroxy 1 group makes two 
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r.ydroger.-hcnds. The side rr.air.s c: ne i ghtcr m - Me:-3Z 
ana Xet--32 are both buneu. Therefore, cr.oschcryla: ::r. 
or Tyr- 7 30 would presumably require prior ur.fcldmc of 





Ot'ri . 



Aside from Tyr-^BO, the five other 
autophosphcrylatior. sices 'including Tyr- 653 and Tyr- 
654) are found m relatively mobile segments of the 
FGFR1 molecule. While not intending to be bound by 
theory, the spatial positions of the autophosphoryiat ion 
sites relative zc the active site suggest that 
autcpnosphoryiation occurs by a trahs mechanism between 
two kinase domains, supporting the hypothesis that 
ligand- induced receptor dimerization is critical for the 
initiation of autophosphoryiat ion events. 

The structure of crystalline FGFR1 : AMP -PC? co- 
complex is essentially similar to that observed for 
crystalline FGFR1 . There are no significant changes m 
the structure of FGFR1 induced by AMP-PCP binding. In 
particular, binding of AMP-PCP, and by extension ATP, 
does not by itself promote lobe closure under the 
crystallization conditions used. Furthermore, 
complexation did not result in any noticeable changes m 
the conformations of the activation and nucleotide- 
binding loops. 

The crystalline FGFR1 : AMP - ?CP co-complex contains 
hydrogen bonds that are present between Nl of adenine 
and the amide nitrogen of Ala-564 and between N6 of 
adenine and the carbonyi oxygen of Glu-562. The adenine 
ring is flanked on one side by Leu-484 and Val-492 (N- 
termmai lobe:.' and on the other side by L.eu-630 

:erminal lobe; . The ribose hydroxy 1 groups make no 



f . r - - - 
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direct hydroger. bonds with pro:e:n acorns. lys-514 is 
hydrogen-bonded :o oxygens of tne 3- and y-phcsphates . 
There is no unambiguous electron density that would 
indicate the positions of Mg : - ions. Generally, AMP-PCP 
appears to be coordinated rather loosely to 
unpnosphorylated F3FR1 , being bound to the "roof" of the 
cleft rather than being tightly sandwiched between the 
two kinase lobes. 

Structural Difference Bstwssn FGF-R and T3K 

Several features distinguish the FGF-receptor 
structure from that of the insulin.- receptor tyrosine 
kinase. These distinctions are likely to be important 
in signaling by FGF-receptors, and other monomeric 
receptors that are believed to undergo iigand- induced 
dimerizat ion . 

The most significant: difference between the 
structures of FGFR1 and IRK 1S i~he -nr - ~-r~ - - — ^ 
activation loop. In FGFR1 , the activation loop is 
20 disposea such that the binding site for substrate 

peptides is blocked not by an activation loop tyrosine, 
as in IRK, but by Arg-661 and PTK- invariant Pro-663, 
while the ATP binding site is accessible. This 
represents another molecular mechanism by which a 

25 recsnrnr ptk maw =i h- ^ •„ j _ _ j 

^ - - - -• — ~- / — — y*^uwiiiiiiuiLca. ; ■ ■ ^ 'jzj s - r ve c. 

autoinhibition in FGFR1 would appear to be weaker than 

that in IRK because of fewer specific interactions made 

by residues in the FGFR1 activation loop ; manifested m 

the relatively higher 3-values) and the accessibility of 

30 the ATP site. One obvious distinction between the 

insulin and FGF receptor families is that m the former, 
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whereas in :r.e latter, receptor d:-r:za:::n is ligand 
dependent . Receptors whose kinase detains are always m 
c^ose proximity nay require a stronger aut omhibi t ion 
mechanism, than those receptors that associate only upon 
ligand binding 'Taylor et al . , 1995). Since most growth 
factor receptors undergo 1 igand- dependent dimerization 
and activation, the FGF receptor auto inhibit i on 
mecnanism appears to joe a mere general one 



Ises of the Crystals and Atomic Structure 
:oordmates 



The crystals of the invention, and particularly the 
atomic structure coordinates obtained therefrom, have a 
15 wide variety of uses. For example, the crystals 

described herein can be used as a starting material m 
any of the art -known methods of use for receptor and 
non-receptor tyrosine kinases. Such methods of use 
include, for example, identifying molecules that bind to 
20 the native or mutated catalytic domain of tyrosine 

kinases. The crystals and structure coordinates are 
particularly useful for identifying compounds that 
inhibit receptor and non-receptcr tyrosine kinases as an 
approach towards developing new therapeutic agents ( see , 
e • g • / Levitzki and Gazit, 1995). 

The structure coordinates described herein can be 
used as phasing models for determining the crystal 
structures of additional native or mutated tyrosine 
Kinase domains, as well as the structures of co-crystals 
or sue n domains with iigands such as inhibitors, 
sconist s , antagonists, and other molecules The 
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mates, as w-Il as r.odels of the three- 



structure :oordir-. s -«, „ s wc _ ^ 

: - n -sref ron-., can als 



dimensional structures ootamec 



used to aid the elucidation of solution-based structures 
of native or mutated tyrosine Kinase domains, such as 
those obtained via NMR . Thus, the crystals and atomic 
structure coordinates of the invention provide a 
convenient means for elucidating the structures and 
functions of receptor and non- receptor tyrosine kinases. 

For purposes of clarity and discussion, the 
crystals of the invention will be described by reference 
to specific FGFRl exemplary crystals. Those skilled m 
the art will appreciate that the principles described 
herein are generally applicable to crystals of the 
tyrosine kinase domain of any cytoplasmic tyrosine 
kinase that undergoes 1 igand- induced dimerization or 
receptor tyrosine kinase, including but not limited to 
the tyrosine kinases of FIG. 6. 

VII> Structure P^ermi nation fo r PTT ^ wir _ h TTnV 

Structure Uping S tructural P ^ r ^ 

Structural r*r>o v-^ - ^ ^ ~ _ _ . . ^ v ^ 

a^-e^ , sucn as tnose set forth in 

Table 1, Table 2, Table 3, and Table 4, can be used to 

determine the three dimensional structures of PTKs with 

unknown structure. The methods described below can 

apply structural coordinates of * r^i,,™-^ ,.. 

_~ tot;L ^ ^cn as an amino acid 

seauenrp Y- vaw , . , . _ 

^ , - - 7 ^^voud__ ju i ci uii i c airrractior h 



ata, 



ox 



nuclear magnetic resonance (NMR) data. Preferred 
embodiments of the invention relate to determining the 
three dimensional structures of PTKs and related 
polypeptides. These include receptor PTKs such as FGF- 
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ruch as Sr.Z, -RK, 3 T K , CSK, AEL, ZAP7 1, FES , FA:' , JAK , 
ina AC\< can a.sc ce used in the T.ethcds described 
i e r e in 



Structures U sing Amino Ari- 



iomoioay 



Homology modeling is a method of applying 
structural coordinates of a polypeptide of known 
structure to the amino acid sequence of a polypeptide o: 
unknown structure. This method is accomplished using a 
computer representation of the three dimensional 
structure of a polypeptide or polypeptide complex, the 
computer representation of amino acid sequences of the 
polypeptides with known and unknown structures, and 
standard computer representations of the structures of 
amino acids. Homology modeling comprises the steps of 
(a) aligning the amino acid sequences of the 
polypeptides with and without known structure; (bj 
transferring the coordinates of the conserved amino 
acids in the known structure to the corresponding amino 
acids of the polypeptide of unknown structure; refining 
the subsequent three dimensional structure; and (d) 
constructing structures of the rest of the polypeptide. 
One SKiiled m the art recognizes that conserved arr.inc 
acias between two proteins can be determined from the 
sequence alignment step m step (a) . 

The above method is well known to those skilled in 
the art. Greer, 1985, Science 228, 1055. Blundeii ez 
al., 1983, Eur. J. Biochem. 1^2, 513. A computer 
program currently utilized for homology modeling by 
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cnose skilled m one arc is me Hone logy nodule in ^: 
Insight II modeling package distributed by Molecular 
Emulations Inc. 

Alignment c: trie amxi^ a~_^ i>~-.- — -- 
5 accomplished by first placing the computer 

representation of the ammo acid sequence of a 
polypeptide with known structure above the ammo acid 
sequence of the polypeptide of unknown structure. Amine 
acids in the sequences are then compared and groups of 
10 amino acids that are homologous (e.g., ammo acid side 
chains that are similar in chemical nature - aliphatic, 
aromatic, polar, or charged) are grouped together. This 
method will detect conserved regions of the polypeptides 
and account for amino acid insertions or deletions. 

Once the ammo acid sequences of the polypeptides 
with known and unknown structures are aligned, the 
structures of the conserved amino acids m the computer 
representation of the polypeptide with known structure 
are transferred to the corresponding amino acids of the 
polypeptide whose structure is unknown. For example, a 
tyrosine in the amino acid sequence of known structure 
may be replaced by a phenylalanine, the corresponding 
homologous amino acid in the amino acid sequence of 

unknown structure. 

- y .^ _ r .,. u _ ture3 c f amino ar ids located in non- 

censerved regions are to be assigned manually by either 
using standard peptide geometries or molecular 
simulation techniques, such as molecular dynamics. The 

final step in the process ^ c^^nv^s 

the entire structure using molecular dynamics and/or 
energy minimization. 
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ana :.as beer, practiced usir.c 
. m.c- ecu^es . The :nree dinensicna: 
the polypeptide corresponding :c the 
:aca^ytic aomam of a serine /threonine protein kinase, 

:ham protein kinase, was homology modeled 
ie oA-MP- dependent protein kinase catalytic 

Knighton et ai . , 1992, Science 255:130-135. 



ire or c. 



".yes:: 



ures Usma Mol ecular Replarp^p n > 



Molecular replacement is a method of applying one 
A-ray diffraction data of a polypeptide of known 
structure to the X-ray diffraction data of a polypeptide 
of unknown sequence. This method can be utilized to 
define the phases describing the X-ray diffraction data 
of a polypeptide of unknown structure when oniv the 
amplitudes are known. X-PLOR is a commonly utilized 
computer software package used for molecular 
replacement. Brunger, 1992, Nature 355:472-475. AMORE 
is another program used for molecular replacement. 
Navaza, 1994, Acta Crystzallogr . A5 0:157-163 
Preferably, the resulting structure does not exhibit a 
root-mean-square deviation of more than 3 A. 

A goal of molecular replacement is to align the 
positions of atoms in the unit cell by matching electron 
dir traction data from two crystals. A program such as 
X-PLOR can involve four steps. A first step can be to 
determine the number of molecules in the unit cell and 
define the angles between them. A second step can 
involve rotating the diffraction data to define the 
orientation of the molecules in the unit cell. A third 
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seep can be to :rar.sla:e :he elec:r:n aer.s::v i :r/ree 
dimensions :c correctly pos:::on the molecules m the 
unit cell. Once the amplitudes and phases of the X-ray 
diffraction data is determined, an R-f actor can be 
5 calculated by comparing electron diffraction, macs 

calculated experimentally from the reference data set 
and calculated from the new data set. An R-f actor 
between 30-50% indicates that the orientations of the 
atoms in the unit cell are reasonably determined bv this 
10 method. A fourth step in the process can be to decrease 

the R-factor to roughly 20% by refining the new electron 
density map using iterative refinement technioues 
described herein and known to those or ordinary skill in 
the art . 



15 



Structures Us ing NMR Data 

Structural coordinates of a polypeptide or 
polypeptide complex derived from X-ray crystaiiograohic 
techniques can be applied towards the elucidation of 



three dimensional structures of polype 



t — ~ — ^ 



nuclear magnetic resonance (NMR) data. This method is 
used by those skilled in the art. Wuthrich, 1986, John 
Wiley and Sons, New York : 176 -199 ; Pflugrath et al . , 
1986, J. Molecular Biology 189:383-386; Kline et al . , 

25 1986. ,7. Molecular Biology 189: "—3 52. While tne 

secondary structure of a polypeptide is often readily 
determined by utilizing two-dimensional NMR data, the 
spatial connections between individual pieces of 
secondary structure are not as readily determinable. 

30 The coordinates defining a three-dimensional structure 

of a polypeptide derived from X-ray crys t a 1 legraphic 
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:ecnn:ques can guioe :r.e NMR scec t rcsconi s t to ar 
understanding or :nese spatial interactions between 

^ments on a cclvcec t ide of 



secondary structural ^ " o^o- 



related structure 
Th< 



ncwiedge of spatial interactions between 
secondary structural elements can greatly simplify 
Nuclear Overhauser Effect 'NOE' data from two- 
dimensional NMR experiments. Additionally, applying 
crystallographic coordinates after the determination 
secondary structure oy NMR techniques only simplifies 
the assignment of NOSs relating to particular amine 
acids in the polypeptide sequence and does not greatly 
bias the NMR analysis of polypeptide structure. 
Conversely, using the crystallographic coordinates to 
simplify NOE data while determining secondary structure 
of the polypeptide would bias the NMR analysis of 
protein structure . 

As the analysis of polypeptide structure by NMR 
methods is a relatively new technique, the use of 
structural coordinates defining a PTK structure will 
most likely be utilized more frequently in the near 
future. As the method progresses, the three dimensional 
structure analysis of polypeptides of the same size as a 
PTK catalytic domain will become more frequent. 

VIXI - gtruc-ture -Based >s:an of Modulators t>tx 

Funct i on v ^ ; -^- na structur al Coordinate 
Structure-based modulator design and identification 
methods are powerful techniques that can involve 
searches of computer data bases containing a wide 
variety of potential modulators and chemical functional 
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groups. The computerized design and id' 
modulators is useful as one computer data oases contain 
more compounds than the chemical libraries, often by an 
order of magnitude. For reviews of structure -based drua 
5 design and identification see Kuntz et al . , 1994, Ace. 
Chen:. Res. 27:117/ Guida, 1994, Current Opinion in 
Struc, Biol. 4: 777; Colman, 1994, Current Opinion in 
Struo . Biol . 4: 868 . 

The three dimensional structure of a polypeptide 

10 defined by structural coordinates can be utilized by 
these design methods. The structural coordinates of 
Table 1, Table 2, Table 3, and Table 4 can be utilized 
by this method. In addition, the three dimensional 
structures of receptor and non-receptor PTKs determined 

15 by the homology, molecular replacement, and NMR 

techniques described herein can also be applied to 
modulator design and identification methods. Thus, the 
structures of receptor PTKs, FGF-R, PDGF-R, FLK, CCK4 , 
MET, TRKA, AXL , TIE , EPK , RYK , DDR, ROS , RET, LTK, R0R1 , 

20 and MUSK, can be utilized by the methods described 

herein. The structures of non-receptor PTKs, SRC, BRK, 
BTK, CSK, ABL, ZAP 7 0 , FES , FAK, JAK, and ACK, can also 
be utilized by the rational modulator design method. 



Ot>o rri & r-\ r~\ s~\ -v > -t— A ^ — * 1 

Wi^v— IH^^AAW^ w j_ J- (_*. i LUilU^. 

for modulators by docking the computer representation oi 
compounds from a data base of molecules. Publicly 
available data bases include: 



ACD from Molecular Desicms Limited 
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c NCI :roT. l.'sticnal Cancer Ir.s::tu:e 

c '_CDC fron Cartridge Orys:allographic Data Center 

a CAST frcr. Chemical Abstract Service 

e Derwent frcr. Derwent Infcrmation Limited 

f. Mayondge fro- Maybridge Cnemical Company LTD 

g 1 Aldrich from Aldrich Chemical Company 

h) Directory of Natural Products from Chapman & Hall 

One such data base (ACD distributed by Molecular lesions 
Limited Information Systems) contains, for example, 
200,000 compounds that are synthetically derived or are 
natural products. Methods available to those skilled m 
the art can convert a data set represented m two 
dimensions to one represented m three dimensions. 
These methods are enabled by such computer programs as 
CONCORD from Tripos Associates or DB-Converter from 
Molecular Simulations Limited. 

Multiple methods of structure -based modulator 
design are known to those in the art. Kuntz et al . , 
1982, J. Mol. Biol. 162: 269; Kuntz et al . , 1994, 
Acc. Chem. Res. 27: 117; Meng et al . , 1992, J". Compt . 
Chew. 13: 50 5; Bohm, 19 94, J. Comp . Aided Molec. Design 
8: 623. 

A computer program widely utilized by those skilled 
m tne art of rational modulator design is DOCK from the 
University of California in San Francisco. The general 
methods utilized by this computer program and programs 
like it are described m three applications below. More 
detailed information regarding seme of these techniques 
can be found in the Molecular Simulations User Guide, 
19 9 5. 
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A typical computer croara- us^ — - - - , _ 

"■ -^.o purpose 

oar comprise the following steps: 

fa) remove ""^e e >: ^ ^ ^ , „ _ - - 

tix^,^ compouna trcm the crctem- 

(b) dock the strurr^e 

- -* — w d.^Luer compound into 

the acrive-site using :ne computer program (such as ^ 
DOCK, or by interactively moving the compound into the 
active- site; 

(O characterize the space between the compound 
and the active-site atoms; 

(d) search libraries for molecular fragments which 
(Dean fit into the empty space beCwefin th& comDound ^ 

the active-site, and (iiJ caa be ilnked tQ fche compound; 

and 

(e) link the fragments found above to the compound 
and evaluate the new modified compound. 

Part (c) refers to characterizing the geometry and 
the complementary interactions formed between the atoms 
of the active-site and the compounds. a favorable 
geometric fit is attained when a significant surface 

2 0 area i <s aVia-ro^ .. . , 

tne compound and active-site 

atoms without forming unfavorable steric interactions. 

One skilled in the art would note that the method 
can be performed by skipping parts (d) and (e) and 
screening a data base of many compounds. 
25 Structure-based desicm and iHp^^,,..,^ 

- * — ^_ , LWli I. J 1 

modulators of PTK function can be used m conjunction 
with assay screening. As large computer data base of 
compounds (around 10,000 compounds) can be searched m a 
matter of hours, the computer based method can narrow 
the compounds tested as potential modulators of PTK 
function in cellular assays. 
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The aoove des:r;c::cns of structure -cased mcdulatcr 
design are not all etcctpassitg and ether methods are 
reported in the literature : 

CAVEAT: Bart let t et al., 193 9, in "Chemical and 
Biological Problems m Molecular Recognition", Roberts, 
S.M.; Ley, S.V.; Campbell, M.M. eds . ; Royal Society cf 
Chemistry : Cambridge , pel 8 2 - 19 6 . 

[2 i FLOG: Miller et al . , 19 94, J. Comp . Aided 
Molec . Design 5:153 . 

(2; PRC Modulator: Clark et al . , 1995, J. Comv . 
Aided Molec. Design 9:13. 

(4; MCSS : Miranker and Karplus, 1991, Proteins: 
Structure , Function, and Genetics 11:29, 

(5) AUTODCCK: Gcodsell and Olson, 1990, Proteins: 
Structure , Function, and Genetics 5:195. 

(6! GRID: Goodford, 1985, J. Med. Chew. 25:349. 

Pegjqn ioy Moaif vinQ Com pounds in Comp le x with ptks 

Another way of identifying compounds as potential. 

modulators is to modify an existing modulator m the 

polypeptide active-site. For example, the computer 

representation of modulators can be modified within the 

computer representation of a PTK active-site. Detailed 

instructions for this technique can be found in the 

Molecular Simulations User Manual, 1995 in LUDI . The 

computer representation of the modulator is modified by 

tne deletion of a chemical group or groups or by the 

addition of a chemical group or grouos . 

Upon each modification to the compound, the atoms 

of the modified compound and active-site can be shifted 

m conformation and the distance between the modulator 
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and the acc:ve-s::e acorns nay be scored along with any 
complimentary interactions formed between the two 
molecules. Scoring can be complete when a favorable 
creo metric fit and favorable complement arv interactions 
5 are attained. Compounds that have favorable scores are 
potential modulators of PTK function. 

Design by Modifying the Structure of Compounds that 3ind 
PTKs 

10 A third method of structure-based modulator design 

is to screen compounds designed by a modulator building 
or modulator searching computer program,. Examples of 
these types of programs can be found in the Molecular 
Simulations Package, Catalyst. Descriptions for using 

15 this program are documented in the Molecular Simulations 
User Guide (1995) . Other computer programs used in this 
application are ISIS/HOST, ISIS/BASE, IS IS/DRAW) from 
Molecular Designs Limited and UNITY from Tripos 
Associates . 

20 These programs can be operated on the structure of 

a compound that has been removed from the active-site of 
the three dimensional structure of a compound- PTK 
complex. Operating the program on such a compound is 
preferable since it is in a biologically active 

2 5 conformation. 

A modulator construction computer program is a 
computer program that may be used to replace computer 
representations of chemical groups in a compound 

COmp leXeO wilH cx _r ± ix vvx__i_ y _ ^ S whl^ ._. — ^ - — 

30 base. A modulator searching computer program is a 
computer program that may be used to search computer 
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represent:::- of ccmpounus frc- a computer data oase 
:na: -a*^ similar three oimensicnai structures and 
similar chemical groups as compound bound to a PTK . 
A typical program can operate by using the 
5 following general steps: 

>a. map the compounds by chemical features such as 
by hydrogen bond donors or acceptors, 
hydrophobic/lipophilic sites, positively lonizaole 
sites, or negatively lonizable sites; 

10 (b; add geometric constraints to the maoped 

features; and 

(c; search data bases with the model generated in 

(b) . 

Those skilled in the art recognize that for 
15 mdolmcnes, the important chemical features include, 
but are not limited to, a hydrogen bond doner, a 
hydrogen bond acceptor, and two hydrophobic points of 
contact. Those skilled m the art also recognize that 
net ali of the possible chemical features of the 
compound need be present in the model of (b) . One can 
use any subset of the model to generate different models 
for data base searches . 



20 



25 



IX - Organic Synthetic Technique 

The versatility of computer-based modulator design 
and identification lies in the diversity of structures 
screened by the computer programs. The computer 
programs can search data bases that contain 200,000 
molecules and can modify modulators already complexed 
with the enzyme with a wide variety of chemical 
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functional groups. A consequence cf :his cherr.ical 
diversity is that a potential modulator of PTK function 
may take a chemical form that is net predictable. A 
wide array of organic synthetic techniques exist m the 
5 art to meet the challenge of constructing these 

potential modulators of ?TK function. Many of these 
organic synthetic methods are described in detail in 
standard reference sources utilized by those skilled in 
the art. One example of such a reference is March, 

10 1994, Advanced Org anic Chemistry; Reactions. Mechanisms, 

and Structure . New York, McGraw Hill, Thus, the 
techniques required to synthesize a potential modulator 
of PTK function identified by computer-based methods are 
readily available to those skilled in the art of organic 

15 chemical synthesis. 



X. Cellular Assays Measuring the Effect of a PTK 
M odulator in Signal Transduction Pathways 

20 Cellular assays can be used to test the activity of 

a potential modulator of PTK function as well as 
diagnose a disease associated with inappropriate PTK 
activity. A potential modulator of PTK function can be 
tested for activity in vitro by assays that measure the 

25 effect of a potential modulator on the 

aut upliospnor y lat ion of a paiLiuulai PTK uvei - expxessed 
in a cell line, Thus, a modulator that acts as a potent 
inhibitor of the catalytic domain corresponding to a PTK 
would decrease the amount of aut ophosphcry lat ion 

3C catalyzed by that PTK. Potential modulators could also 
be tested for activity m cell growth assays in vitro as 
well as in animal model assays in vivo. 
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:es::r.g the 

b:c3-:v:ty of a pete-:-! -odula:cr designer by the 
"e:hods of the invention. 

Materials, T.etncds, and experimental data for these 
assays are fully described in WO 96/40116 published en 
December 19, 1996, entitled -ndclmone Co-pounds for 
the Treatment of Disease". This application is 
incorporated herein by reference m its entirety, 
including all drawings, figures, and tables. 

XI * Administration Modulator-* cf P~< ^- crmn ^ 

Therapeutics f or DIsp^ sp 

Methods of administering compounds to organisms as 
therapeutics for disease are fully described m WO 
96/4C1I6 published on December 19, 1996, entitled 
"Indclinone Compounds for the Treatment of Disease''. 
This application is incorporated herein by reference m 
its entirety, including all drawings, figures, ana 
tables . 



EXAMPLES 

The examples below are non- limiting and are merely 
representative of various aspects and features of the 
present invention. The examples provide illustrative 
25 methods for obtaining crystalline forms of protein 

kinase polypeptides, methods for determining three 
dimensional structures of these protein kinase 
polypeptides, and methods for identifying modulators of 
protein kinases using the three dimensional structures 
of the protein kinases. 



3C 
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EXAMPLE 1: X-rav Crystal logra chi c ^r;:-::^ 

Dece-ma- ^ of FGFF1 

Polypeptide Synthesis and Isolation 
5 A recombinant baculovirus was engineered to encode 

residues 455 - 765 of human FGFRi . A cieavable N- terminal 
histidine tag was incorporated to aid m protein 
purification. Three ammo acid substitutions were 
introduced: Cys-488 to Ala, Cys-584 to Ser and Leu-457 

10 to Val. The two cysteine substitutions were made to 

prevent the formation of disul f ide - 1 inked oligomers, 
which occurs for the native protein. The substitution 
Leu-457 to Val introduced a Ncol cloning site near Met - 
456. The codon for Tyr-766 (TAG) was changed to a stop 

15 codon (TAG) and a Hindlll- cloning site was generated 

following this stop codon. These substitutions were 
introduced into the full length human cDNA of FGFR1 in 
ml3MPI 9 by s ite - directed mutagenesis according to the 
manufacturer's protocol (Amersham) . 

20 The resulting construct was digested with Ncol and 

Hindlll and was ligated into appropriately digested 
pBlueBac HistagB (Invitrogen) . Transf ection of insect 
cells (Sf9) was performed with the BaculoGold 
transfection system according to the manufacturer's 

25 protocol (Pharmmgen) . Following identification of 
oocitive plaque, *_ he recombinant ioacuioviru^ was 
amplified ro high titer (5xl0 7 virus particles/mi; . sf9 
cells were grown in 175 - cm 2 flasks to a density of 2- 
3xlC 7 per flask and infected with recombinant baculovirus 

30 with a multiplicity of infection (MCI) of 10. 

After 48 hr, cells were harvested by centrif ugation 
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at 3,OG0g for 3 5 mm a: 4 C and :r.er. lysed :r. 2 5 mM 

MgC! 



KtPES -pH 7.5), 150 ttuY XaCl, 10% alvcertl. 1 = -v v 



- % -r::on X-1C0, 10 ng/ml aprctcmr., 10 ag/ml 
leupeptin, and l tr.M pher/ylmethylsulfcnyl fluoride 
5 (P.MS?: . Lysates were cer.trifuged ir. a Scrvai RC 5 0 

(Dupont.. for 1 hr at 4 C C a: 40,000g followed by 
ulcracencrifugaticn m an XL- 8 0 (3eokmar.; at 100,000g 
for 1 hr. After centri f ugation, tne clarified iysate 
was passed over a Ni- -chelating colutr.n (Pharmacia.:, and 
10 tne bound hist idine - tagged fusion protein was eluted 

with 100 mM imidazole (pK 7. 5 ). Pooled fractions were 
loaded onto a Mono Q anion exchange column (Pharmacia; 
and eluted with a NaCl gradient from 0 to 500 mM . 

The fractions containing the fusion protein were 
15 concentrated in a Centricon-30 (Ami ccn; , and the 

histidine tag was removed ky overnight digestion with 
enterokinase (3iozyme) at 2C°C. The digestion was 
terminated by the addition of aprotonin, leupeptin, 
PMSF, TPCK, and bovine pancreatic trypsin inhibitor 
20 (BPTI ) . The cleaved kinase domain was then separated 

from the histidine tag on a Superose 12 size-exclusion 
column (Pharmacia) . The eluted kinase domain was 
further purified on a Mono 2 column. The purified 
kinase domain was analyzed D y N-terrr.inal sequencing and 
25 mass spectrometry. Five amino acids (SAAGT) remained 
from the histidine tag. The predicted molecular mass 
was confirmed by mass spectrometry. 



Crystal Gmwrh 

Purified FGFRl was concentrated to 20-50 mg/ml and 
exchanged into 10 mM Tris-KCl ( pH 3.0). 10 mK NaCl, and 
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2 mM DTT using a Cent r i con - 3 0 . Crystals were grow: 
4°C by vapor diffusion m hanging drops containing 
ul of 10 T.g'nl protein solution and 2.0 ul of rese: 
solution: 16% polyethylene glycol (PEG; 10001, 0.3 
5 (NH.-.SG.. , 5% ethylene glycol, and 100 rr.M bis-Tris 

6.5) . 

Crystals of native FGFRi were soaked m 500 m.l 
stabilizing solution [25% PEG 10000, 0.3 K (NK4),S0 4 , 0.1 
M Bis-Tris (pH 6.5), 5% ethylene glycol] containing 3- 

10 L ( 3 - (2 -carboxy ethyl ) -4 - met hylpyrrol - 5 -yl ) methylene] -2 - 

indolinone (1-5 mMi or 3 - [ 4 - ( 4 - f ormylpiperazme - 1 -yl ) - 
benzylidenyl] -2 -indoiinone (1 mM) at 4°C for 24 to 48 
hours. The final soaking concentration of DMSO was 
between 1 to 5%. The crystals cracked at higher 

15 concentrations of DMSO . 

Co-crystals of FGFR1 with the inhibitors could also 
be obtained by vapor diffusion in hanging droos 
containing 2.0 jxl of 10 mg/mi protein solution and 2.0 
Mi of reservoir solution containing 1 mM 3-[(3-(2- 

20 carboxyethyl) - 4 -met hylpyrrol - 5 -yl ) methylene] -2- 

mdolinone and 3 - [4 - ( 4 - f ormylpiperaz ine - 1 -yl - 
) benzylidenyl] - 2 - indolinone . 

Co-crystals of FGFR1 complexed with AKP-PCP were 
obtained as described for the creation of native 

25 crystals , except "in: the protein solution additionally 

contained 10 mM AM? - PCP and 20 mM MuCl-, . 

Preparation Of Heavy A tom Derivative Crystals 

Heavy atom derivative crystals were octained by 
30 soaking FGFR1 native crystals (C2-A form) in a solution 

containing et hy imercur i thicsal icyl ic acid (thimerosal; , 
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vat a — C;»;lgccicr. and Structure Determination 

For native crystals and crystals comprising the 
nucleotide analog MP-PC?, data were collected either on 
a Rigaku RU-200 rotating anode operated at 50 kV and 100 
mA ;cu Ka) and equipped with double - focusing mirrors and 
an R-AXIS lie image plate detector, or at beamiine X-4A 
at the National Synchrotron Light Source, Brookhaven 
National Laboratory. Syncnrctrcn data (A=l . C7A) were 
collected on Fuji image plates and read with a Fuji 
scanner. One cryc-cooled crystal was used for each of 
the data sets. To obtain cryo-cooied crystals, crystals 
were soaked in a cryo-protectant solution containma 25% 
PEG 10000, 0.3 M (NK 4 ) 2 SO«, 5% ethylene glycol or 
glycerol and 100 mM bis-Tris (pH 6.5), and were flash- 
cooled either in liquid nitrogen directly (synchrotron 
data) or in a dry nitrogen stream at -175°C (rotating 
anode data) . All data were processed using DENZO and 
SCALEPACK. Ctwmowski, 1993, "Oscillation data 
reduction program," Proceedings of the CC?4 Study 
Weekend, Sawyer et al . , eds . (Daresbury, United Kingdom: 
SERC Daresbury Laboratory), 56-62. 

For native crystals and crystals comprising the 
nucleotide analog AMP-PCP, a molecular replacement 
solution was found initially for the C2-B crystal form 
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using an IRK search model :ha: consisted of pclyalanme 
with the cot^od side chains for residues 993 -1263 :fGFR1 
residues 475-754;, excluding residues 1094-1105 'kinase 
insert; and 1153-117 0 (activation loop; . Wich AM ORE 
5 INavaza, 1994, AmoRe : an automated package for molecular 

replacement," Acta Crystallogr. A50: 157-163;, using 80% 
of the structure factor amplitudes between 15.0 and 3.5 
A, one of the two molecules m the asymmetric unit was 
located. The correlation coefficient (c.c.; for the 

10 correct 1-moiecule solution was 0.23 (versus 0.20 for 

the highest incorrect solution} . This molecule was 
rigid body-refined in X-PLOR (Brunger, 1992, X-PLOR 
(Version 3.1) Manual (New Haven, Conneticut: The Howeard 
Hughes Medical Institute and Department of Molecular 

15 Biophysics and Biochemistry, Yale Uiversity) ) , first as 

one rigid body unit, then as two units each comprising a 
lobe of the kinase. Rigid body refinement (12.0-3.5 A, 
F>3o) resulted in a relative rotation of the two lobes 
of -10° and an increase of the c.c. from 0.20 to 0.25. 

20 The rigid body-refined molecule was then used as a new 
search model in AMORE , and this time both molecules in 
the asymmetric unit were located. The c.c. for the 
correct 2-molecule solution was 0.35 (versus 0.27 for 
the highest incorrect solution) . 

25 Multiple cycles r,f rr-. 0 del building and refinement: 

against 6.0-2.4 A data resulted m the addition to the 
model of many of the side chains and some of the missing 
polypeptide chain. Model building was performed using 
TOM/FRODO (Jones, 1985, "Diffraction methods for 

30 biological macromolecuies . Interactive computer 

graphics: FRODO , " Mezhods in Enzymology 115: 157-171; 
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ar.a :orr uga:e-grad:en: r.:r.:r.:za: ^ c*^ - ^ ^ q 
annealing were performed us ma X-FLT. 3 
At: this srage, the P.- value was 3 0% .free R- value of 
36%, . To help expedite model building and refinement, 
experimental phases were obtained. Because crystals 
grown in the presence of ethylene glycol were easier to 
manipulate than those grown m glycerol, several heavy- 
atom derivative data sets were collected from C2-A 
crystals tnat had been soaked in various heavy atom 
solutions. The C2-3 structure was subsequently refined 
against 6.0-2.4 A data to an R-value of 23.8% ;free R - 
value of 30.4%) with r.m.s.d. values of 0.008 A for bond 
distances and 1.4° for bond angles. 

Molecular replacement was used to locate the two 
FGFRl molecules (designated FLGK-A and FLGK-B) m the 
asymmetric unit of the C2-A crystal form. Using AMORE 
with 80% of structure factor amplitudes between 15.0 and 
3.5 A and the C2-3 model, the c.c. for the correct 2- 
moiecuie solution was 0.62 (versus 0.35 for the highest 
incorrect solution) . Heavy atom positions were 
determined from difference Fourier maps using the 
calculated phases from the partial model. Refinement of 
heavy atom parameters and phase determination were 
performed with MLPHARE (Otwinowski , 1991, "Maximum 
likelihood refinement of heavy atom parameters," 
Isomorphous replacement and anomolous Ssattering, Evans 
and Leslie eds . (Darsbury, United Kingdom: SERC 
Daresbury Laboratory), 56-62;). An initial molecular 
isomcrphous replacement (MIR) -phased electron density 
map was calculated with data between 2.0. and 2.8 A 
resolution. This map was improved by solvent 
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flattening, histogram matching, and nor. - cry s call ogr aphi c 
symmetry (NCS averaging using DK (Ccwcan, 1994, 
"Protein Crystallography," CCP4 and ESF-EACBM Newsletter 
(joint) 31: 34 -33). 

Refinement of the C2 -A FGFRl structure against 6.C- 
2.0 A data proceeded by cor.] ugate -gradient minimization 
and simulated annealing using X-PLOR. Tight NCS 
restraints were imposed until data to 2.0 A resolution 
were included m the refinement, at which point the 
restraints were lifted. An overall anisotropic B-vaiue 
was calculated using X-PLOR and applied to the observed 
structure factors, reducing the R-value by -3%. Water 
molecules whose 3-vaiues refined to s7C A 2 were omitted 
from the subsequent refinement round. The average B- 
vaiue is 37.5 A 2 for all protein atoms, 35.4 A 2 for 
protein atoms in FLGK-A, 39.7 A 2 for protein atoms in 
FLGK-B , and 40.2 A 2 for water molecules. The side chains 
for Cys-603 in FLGK-A and FLGK-B and for Met-534 in 
FLGK-B have been modeled in two different conformations. 
Residues that are not included in the atomic model due 
to poor supporting electron density are for FLGK-A: 456- 
463, 486-490, 501-504, 580-591, 763-765; and for FLG-B : 
456-460, 501-504, 578-593, 646-651, 657-659, 762-765. 

The positions of the two AMP-PC? molecules (one per 
FGFRl molecule) wprp p 3 oi iv i -i -e . „ 

— 1 — ' — — - • — - • L11 ^ r . 

Fc.icFOFRn difrerence Fourier maps. Tne AMP-POP molecule 
bound to FLGK-B is less tightly bound and has been 
modeled with an occupancy of 0.5. 

Table A summarizes the X-ray crystallography data 
sets of FGFRl derivative crystals that were used to 
determine the structures of crystalline FGFRl and 
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~F?.l : AM? - 




-plex 












TABLE 


> 










Data Collection and MIR Phasins 


Summary 








Native 


AMP-PCP 




Thj-2 3 


PC M B * 


k' A u if " V 
Lit ^ .\ 


X-ra> source 


X-4A 


RU-200 


RU-200 


RU-200 


R i * ~>nn 


x\U -zuU 


Resolution limit (A) 


2.0 


2.3 


2.6 


2.8 


o 




Number of sites 


— 


— 


4 








Cone. (mVl)/time (h) 






0.1/24 


0.1/48 


0.2/2 


5.072 




4.8(19.7/- 


4.5(23.3) : 


5.5 


9.8 


\J . o 


0 . ft 


Total observations 


122569 


91324 


55456 


59488 


67088 


a5 303 


unique reflections 


50771 


31997 


42820^' 


35538" 


1 86 1 9 


i r*>pp 


Completeness (%) 


97.3(96.3)= 


95.5(93.7) : 


95.0 


96.7 


98.0 


97.7 


Signal (%!>3a) 


80.7(50.3) c 


79.6(51.7)= 


69.8 


66.8 


84.7 


77.6 








17.1 


31.2 


15.4 


15.2 


Phasing power'' 






1.8 


2.0 


1.0 


0.9 








0.55 


0.50 


0.81 


0.84 


Overall FOM h 








0.60 







■Thi-l. Thi-2: ethylmercurithiosalicylic acid (thimerosal); PCMB: 4-chloromercuribenzoic acid 

b R, vm = 100 x EAmhV^h^l/ShSiliCh) 

c Value in parentheses is for the highest resolution shell. 

d I(-h) and l(-h) processed as independent reflections. Anomalous scattering contributions were 
2 5 included. 

f R, sc = 100 x E s i iF P (h)=tF 3 (h)-- ! F PH (h)| F 5 (h) . where F r and F PH are the native and derivative 
structure factors, respectively 

Phasing power: r.m.s. heavy atom structure factor r.m.s. lack of closure (for acentric reflections 
from 20.0 to 2.8A). 

30 -Mollis = 100x2, : F PH (h),-F H . ca . c) (h) /i; h iFp H (h^F P (h)| (for centric reflections from 20.0 to 2. 8A). 

'Figure of merit: ;P(<J»exp(i<|))d<J)/ f P((J))d(<J>), where P is the probability distribution of the phase 
angle 
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For crystals comprising F3FR1 and compounds 1 and 
2, data were collected on a Rigaku RU-200 rotating anode 
(Cu Ka) operating at 5 3 kV and 100 mA and equipped with 
double- focusing mirrors and an R-AXI3 IIC image plate 
5 detector. One cryc-cooled crystal was used for each of 

the data sets. Crystals were soaked m a 
cryo-protectant [25% PEG 10000, 0.3 M (NHJ 2 SO, , 5% 
ethylene glycol, 100 mM bis-Tris (pK 6.5), and 1 mM : 3- 
[ (3- (2-carboxyethyl) - 4 - met hyipyrrol - 5 - y 1 ) me t hy iene ] -2- 

10 indolinone (hereafter referred to as compound 1) or 3 - 

[4 - ( 4 - f ormylpiperazine- 1 -yl - ) benzyl i deny 1 ] - 2 - indolinone 
(hereafter referred to as compound 2) and flash-cooled 
in a dry nitrogen stream at - 1 7 5 0 C . Data were processed 
using DENZO and SCALE PACK. Otwinowski , 19 93, 

15 Proceedings of the CCP4 Study Weekend (Daresbury, United 

Kingdom: SERC Daresbury Laboratory) pp 56-62. 

A summary of the data collection parameters are 
included in the following Table 6 : 

2 0 TABLE 6 





Resolution 
limit (A) 


Observa- 
tions (N) 


Complete- 
ness (%) 


Redundan- 
cy 


*W (%) 


Signal 
(I>al) 


compound 
1 


2.5 


93535 


97.6 (96.1) 


2.7 


6.8 

(23.0) 


1 1.8 


compound 
2 


2.4 




oq i rgj o\ 


~i *> 

_> _> 


6 ^ 

(32.2) 


! I .4 

1 



compound 1 structure; 550 residues, 252 water molecules. 2 compound 1 molecules (4589 atoms) 
compound 2 structure: 550 residues, 248 water molecules, 2 compound 2 molecules (4646 atoms) 

3 0 Structure Analyses 

Atomic superpositions were performed with TOSS 
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surrace Calculations were dene with X-PLC?. . Tie s"-- = --- 
area buried tn a di-er interface was calculated with 
GRASP :Xionolls et al . , 1991' using a probe radius cf 
1.4 A. The stereochemical quality of the atomic -no-del 
was monitored using PRCCHSCK (Laskowski ec al . , 1993, 
FRCCHECK: a computer program to check the stereochemical 
quality of protein structures," J. Appl . Crysc. 26: 2 33 - 
291}. As defined in FRCCHECK, 93% of the residues i- 
che model have main-chain torsion angles in the most 
favored Ramaonandran regions. There are no residues in 
disallowed regions, and three residues m generously 
allowed regions: Arg-622 in FLGK-A and FLGK-B and Arg- 
554 m FLGK-A. The overall G-f actor score is 0.42. 

Table 7 summarizes the X-ray crystallography 
refinement parameters of the structures of crystalline 
FGFR1 and crystalline FGFR1 :AMP-PCP co-complex of the 
invention. Table 8 summarizes the X-ray crystallography 
refinement parameters for the FGFRl / compound complexes. 

TABLE 7 



Refinement Parameters 



FGFRl : 550 residues, 252 water molecules (4589 atoms) 

FGFRl :AMP-PCP: 550 residues. 238 water molecules. 2 AMP-PCP molecules (4638 atoms) 
d-spacings Reflection R-value' R.m.s.d. 



Model 



s 



lN ") ( 0/ °> bonds (A) ansles(') B-values f 

■ (A=) 



FGFRl 



6.0-2.0 42548 21.3 (26.2)' 0.008 1.3 1.6 



FGFRl :AMP-PCP: 6.0-2.3 26729 20.1(27.51-- 0.009 \A 

3 0 'R-value 1 00 x E, F ob ,<h)j - IF ;slc (h) ! Z„iF c5s( h): for reflections with F ob >2o. 
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-For bonded protein atoms 

'Value in parentheses is the free R-vaiue ( Brunger. S 993 ) determine J rrom 5° o of the data 



TABLE 8 



Model 


d-spacmgs (A) 


Reflec- 
tions 


R- 

value : (N) 


bonds ( A) 


angles (°) 


B- 

values' 


compound 
1 


6.0-2.4 


42548 


19.7 
(27.0) k 


0.008 


1.3 


1.6 


compound 


6.0-2.5 


26729 


20.0 
(28. or 


0.008 


1.4 


1 . 7 



•R, )m - 1 00 x S h S ( |l,(h) - 1(h) 0 ! / S h S ; 1(h) 

c Va!ue in parentheses is for the highest resolution shell. 

'R-value = 1 00 x S h \ F 0 (h)i - .F c (h)j| / S h F 0 (h);, where F 0 and F c are the observed and calculated 
structure factors, respectively (F Q > 2s). 
J For bonded protein atoms. 

k Value in parentheses is the free R-value determined from 5% of the data. 

Atomic Structural Coordinates 

Tables 1 and 2 provide the atomic structural 
coordinates of unphosphory lated FGFRI and 

unphosphorylated FGFR1 : AMP-PCP co-complex, respectively. 
In the Tables, coordinates for both of the FGFR1 
molecules of the dimer comprising the asymmetric unit 
are provided. The amino acid residue numbers coincide 
with those used in FIG . 3. 7m zhe first FGFRI molecule 
of the dimer the residue number is preceded by a I, 
i.e. , residue number 464 of the first FGFRI molecule of 
the dimer is denoted by "1464". Tables 3 and 4 provide 
the atomic structural coordinates of FGFRI m complex 
with indolmone compounds found to inhibit FGFRI 
f unc t ion . 
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"Atom Typp" refers :c :r.e element whose coordinates 
are provided. The first letter ir. toe column defines 
t h e e 1 e m e n t 

"A,A - 11 refers to amine acid. 

"2L. — Y and Z " provide tie Cartesian coordinates of 
the element . 

M S" is a thermal factor that measures movement of 
the atom around its atomic center. 

" QCC " refers to occupancy, and represents the 
percentage of time the atom type occupies the particular 
coordinate. OCC values range from 0 to 1, with 1 being 
100% . 

"£RH n or " PRT2 " relate to occupancy, with ?RT1 
designating the coordinates of one atom when m the 
first conformation and PRT2 designating the coordinates 
of the atom when in the second or alternate 
conformation . 

Structural coordinates for FGFR1 may be modified by 
mathematical manipulation. Such manipulations include, 
but are not limited to, crystallographic permutations of 
the raw structure coordinates, f ract ionali zat ion of the 
raw structure coordinates, integer additions or 
subtractions to sets of the raw structure coordinates, 
25 inversion of the raw structure coordinates and any 

combination of the above. 

In addition, the structural coordinates can be 
slightly modified and still render nearly identical 
three dimensional structures. Therefore, a measure of a 
unique set of structural coordinates is the root -mean- 
square deviation of the resulting structure. Structural 



20 



3C 
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25 



3C 



coordinates that render three dimensional structures 
that deviate frox. one another by a root --ear-square 
deviation of less than 1.5 A nay be viewed as identical. 

EXAMPLE 7 : Computer-R^ed Be b i q- ~ f 

FTK Func- i on 

Potential modulators of PTK function were designed 
and identified by operating the prograrr. Catalyst on the 
structure of 3- [ (3- (2- carboxyethyl) -4 -methyipyrrol -5- 
yDmethyiene] - 2 - indoimone . The chexical features 
constraining the search model include a hydrogen bond 
donor, a hydrogen bond acceptor, and two hydrophobic 
points of contact. Approximately 4 0 compounds were 
identified as potential modulators of PTK function using 
this method. 

The compounds identified by the method as potential 
modulators of PTK function were commercially available. 
These compounds were then tested for their ability to 
inhibit the FLK PTK in an enzyme linked immunosorbant 
assay (ELISA) . The method of performing this assay is 
taught in WO 96/40116, entitled "Indolinone Compounds 
for the Treatment of Disease," published on December 19, 
1996, invented by Tang et al., incorporated by reference 
herein in its entirety, including all figures, drawings, 
" vA " abies - FiK 1 specific antibodies can be prepared 
from the following protocol: 

1. Prepare a Tresyl -Act ivated Agarose/Flk - 1 -D column 
by incubating 10 ml of Tresyl -Act ivated Agarose 
with 20 mg of purified GST-Flk-l-D fusion protein 
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^. _ne coiu-n once with F3S . 

3. Block the excess sices on the column wicn 2 v 
glycine fcr 2 Incurs an 4 C C. 

4 . Wash the c c 1 umn w 1 c h ?B S . 

5. Incubate the column wich Rabbit anti-Flk-iD 
production bleed for 2 hours at 4°C. 

6. Wash the column with PBS . 

Zlute antiserum wich 100 rr.M Citric Acid, pK3 . : and 
neutralize the eluate immediately with 2 M Tns, pH 
9.0. 

8. Dialyize the eluate against PBS overnight at 4oC 
with 3 changes of buffer (sample to buffer ratio is 

1:10 0) . 

9. Adjust the diaiyized antiserum to 5% glycerol and 
store at -80°C in small aliquotes . 

The Flk-1 ELISA can include a 2, 2-azmo-bis (3- 
ethylbenz-thiazoline-6-sulfomc acid (ABTS) solution, 
which can comprise lOOmM citric acid (anhydrous), 250 mM 
Na 2 HP0 4 (pH 4.0), 0.5 mg/ml A3TS (Sigma catalog no. A- 
1888) . The solution is most appropriately stored in 
dark at 4°C until ready for use. 

The FLK-1 specific antibodies can also be purchased 
from Santa Cruz Biotechnology (Catalog No: SC-504) . 

Four of the forty compounds identified as potential 
modulators of PTK function were potent modulators of FLK 
tunction. These molecules have tne followinc 
structures : 
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The modulators inhibit the FLK protein kinase with the 
following IC 50 values: 
5 TABLE 9 



Compound 


FLK kinase 


FLK kinase 


EGFR 


IGF-IR 




ic 50 


ic so 


ic 3U 






(MM) 


(MM) 


(^M) 


(wM) 




compounds 


compounds 








tested at lOOuM 


tested at 20/^M 






1 


14.8 


14 


> 100 


>100 


2 


15.7 


10.6 


>100 


>100 


3 


21.4 


16.6 


68 


30.9 


4 


22.9 


16.4 


> 1 00 


i 



The invention illustratively described herein may 
be practiced in the absence of any element or elements, 
15 limitation or limitations which is net specifically 

disclosed herein. The terms and expressions which have 
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been eniplcyed are used as terms of descr;p:ic:; and net 
cf limitation, and there is no intention tnat m tne use 
of such terms and expressions of excluding any 
equivalents of the features shewn and described or 
portions thereof, but it is recognized tnat various 
modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that 
although tne present invention has been specifically 
disclosed by preferred embodiments and optional 
features, modification and variation of the concepts 
herein disclosed may be resorted to by those skilled m 
t:he art, and that such modifications and variations are 
considered to be within the scope of this invention as 
defined by the appended claims. 

Those references not previously incorporated herein 
by reference, including both patent and non-patent 
references, are expressly incorporated herein by 
reference for all purposes. Other embodiments are 
within the following claims. 
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:neral informatio:^ 



H2K 



APPLICANT 



SUGE N ., INCORPORATED 
3 51 Galveston Drive 
Redwood City,. CA 94063 



TITLE OF INVENTION. 



3RYSTAL STRUCTURES OF A 
PROTEIN TYROSINE KINASE 



(ill) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS 



(B) 

(C) 
(D) 
(E) 
(F) 



STREET : 

CITY : 
STATE : 
COUNTRY : 
ZIP : 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE : 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
I3M P . C . DOS 5 . 0 
FastSEQ for Windows 2.0 



CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 



id/ in ' j jjk 1 & ; 



To Be Assigned 

Herewith 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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REGISTRATION NUMBER: 32,327 
REFERENCE DOCKET NUK3EE : 22^/088-PCT 



' xx TELECOMMUNICATION INFO RMA T I G N : 

■'A; TELEPHONE: '213) 489-16CC 

'■ B ; TELEFAX : '213)955- 0440 

;C. TELEX : 57 -3 510 



'2; information for seq ID NO : I : 

l 1 SEQUENCE CHARACTERISTICS: 



tAy LENGTH: 310 ammo acids 

(3) TYPE: 

(") STRANDEDNESS : 

:d) topology : 



a mi no acid 
single 
1 inear 



nrotem 



{11! MOLECULE TYPE: 
(ill 1 HYPOTHETICAL: NO 
fxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Leu Ala Gly Val Ser Glu Tyr Glu Leu Pro Glu Asp Pro Arg Trp 
1 5 10 15" 

31u Leu Pro Arg Asp Arg Leu Val Leu Glv Lvs Pro L^u G 1 y Glu 
20 25 ' 30 

Cys Phe Gly Gin Val Val Leu Ala Glu Ala He Gly Leu Asp Lys Asp 
35 40 45 

Lys Pro Asn Arg Val Thr Lys Val Ala Val Lys Met Leu Lys Ser Asp 
50 55 60 

Ala Thr Glu Lys Asp Leu Ser Asp Leu He Ser Glu Met Glu Met Met 
65 7 ° 75 ao 

Lys Met He Gly Lys His Lys Asn He He Asn Leu Leu Gly Ala Cys 
85 90 95 

Thr Gin Asp Gly Pre Leu Tyr Val He Val Glu Tyr Ala Ser Lvs Gly 
ICC 105 no 

Asn Leu Arg Glu Tyr Leu Gin Ala Arg Arg Pro Pro Gly Leu Glu Tyr 
II 5 120 125 

Cys Tyr Asn Pro Ser His Asn Pro Glu Glu Gin Leu Ser Ser Lys Asp 
13C 135 140 



BNSOOCID: <WO 9607836A2_IA> 



WO 98/07835 



PC TLS97/ 14885 



- eu v ' a - Ser Cy = Ala ivr G_n Val Ala Arg Gly Me: G^u Tvr Leu Ala 

Ser Lys Lys Cys lie His Arg Asp Leu Ala Ala Arg Asn Val Leu Val 
165 " : -, 5 

Thr Glu Asp Asn Val Met: Lys lie Ala Asp Pae Gly Leu Ala Ar- Aso 
180 165 190 

lie His His lie Asp Tyr Tyr Lys Lys Thr Thr Asn Gly Arc Leu Pre 

195 20C 205 

Val Lys Trp Met Ala Pro Glu Ala Leu Phe Asp Arg lie Tyr Thr His 
210 215 220 

Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Tro Glu He Phe Thr 
225 230 235 " 240 

Leu Gly Gly Ser Pro Tyr Pro Gly Val Pro Val Glu Glu Leu Phe Lys 
245 250 255 

Leu Leu Lys Glu Gly His Arg Met Asp Lys Pro Ser Asn Cvs Thr Asn 
260 265 270 

Glu Leu Tyr Met Met Met Arg Asp Cys Trp His Ala Val Pro Ser Gin 

275 280 285 

Arg Pro Thr Phe Lys Gin Leu Val Glu Asp Leu Asp Arg He Val Ala 
290 295 300 

Leu Thr Ser Asn Gin Glu 
305 310 



(2; INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

in; MOLECULE TYPE: protein 
(ill/ HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser G^u Tyr Glu Leu Pre 
- 5 10 15 

Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu Val Leu Gly Lys 
20 25 30 
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Ser Asc 



Ala .nr GIu Lys Asp Leu Ser Asd L<= 
7 0 ' 



u e ^ <=> > 



.u Met GIu Met Met Lys Met 11 



85 



le Gly Lys His Lys Asn He He 



A sr. 



Leu Leu Gly Ala Cys Thr Gin 

IOC 



Asp Gly Pro Leu Tyr Val H e Val 



Tyr Ala Ser Lys Gly Asn Leu A-rrr 



Arg Hu Tyr Leu Gin Ala Arg Ara 

120 



Gly Leu GIu Tyr Ser Tvr 



13< 



vr Asn Pro Ser His Asn Pre GIu GIu Gl: 



135 



140 



Leu Ser Ser Lys Asp Leu V 



145 



15 0 



al Ser Cys Ala Tyr Gin 



155 



/al Ala Arg Gly 



Met GIu Tyr Leu Ala Ser 



165 



Lys Lys Cys He His Arg Asp Leu Al 



160 



a Ala 



170 



175 



Arg Asn Val Leu Val Thr GIu 



190 



Asp Asn Val Met Lys II 



185 



e Ala Asp Phe 
190 



Gly Leu Ala Arc Asp He His Hi 



195 



s He Asp Tyr Tyr Lys Lys Thr 



200 



Thr 



205 



Asn Gly Arg Leu P 
210 



ro Val Lys Trp Met Ala P 



215 



ro GIu Ala Leu Phe Asp 

220 



Arg He Tyr Thr His Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu 



23 0 



235 



240 



Trp GIu He Phe Thr Leu Gly Gly Ser Pr 



245 



o Tyr Pro Gly Val Pro VaJ 



250 



255 



GIu GIu Leu Phe Ly 



260 



s Leu Leu Lys GIu 
265 



ily His Arg Met Asp Lys Pro 
270 



Ser 



Msn Cys Thr Asn GIu Leu Tyr Me 

9 ~> ^ 

* - 2 8 0 



t Met Met Arg Asp Cys Trp His 
285 



Ala Val Pro Ser GIr 

290 



n Arg Pro Thr Phe Lys 
295 



Leu Val GIu Asp Lev 

3 00 



Asp Arg He Val Ala L 

305 



u .nr Ser Asn Gin GIu 
31C ^ c 
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INFORMATION FOR SEI ID NO : 3 : 

ii; SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 351 amino acids 

(3! TYPE: amine acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY linear 

(n) MOLECULE TYPE protein 
(ill) HYPOTHETICAL: NO 
(xi ) SEQUENCE DESCRIPTION: 5EQ ID NO : 3 : 

Met Arg Gly Ser His His His His His His Gly Met Ala Ser Met Thr 
1 J 5 10 15 

Gly Gly Gin Gin Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 

20 25 30 

Pro Ser Ser Arg Ser Ala Ala Gly Thr Met Val Ala Gly Val Ser Glu 

Tyr Glu Leu Pro Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu 
50 55 60 

Val Leu Gly Lys Pro Leu Gly Glu Gly Ala Phe Gly Gin Val Val Leu 
65 70 75 80 

Ala Glu Aia lie biy Leu asp -^V^ ^ S P w - v + 

3 5 9 0 9 5 

Val Ala Vai Lys Met Leu Lys Ser Asp Ala Thr Glu Lys Asp Leu Ser 



Asp Leu lie Ser Glu Met Glu Met Met Lys Met lie Gly Lys His Lys 
115 120 125 

Asn He He Asn Leu Leu Gly Ala Cys Thr Gin Asp Gly Pro Leu Tyr 
130 135 140 

Val He Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu Gin 
145 150 155 16C 

Ala Arg Arg Pro Pro Gly Leu Glu Tyr Ser Tyr Asn Pro Ser His Asn 

165 17C 175 

Pro Glu Glu Gin Leu Ser Ser Lys Asp Leu Val Ser Cys Ala Tyr Gin 
180 185 190 

Val Ala Arg Gly Met Glu Tyr Leu Ala Ser Lys Lys Cys He His Arc 
195 2C0 205 
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«sp Leu r^^a A_a Arg Asn Val Leu Val Thr Glu Asc Ac- va 1 v e - T 
210 215 22 ; 

:ie Ala Asp Phe Gly leu Ala Arg Asp ;ie His His He Asc Tvr Tv- 

225 23C 2 3 5 " * " Z *! ' 

Lys Lys Tr.r Thr Asn sly Arg Leu Pre Val Lys Trp Met Ala Pro Glu 

245 25c 255 

Ala Leu Phe Asp Arg He Tyr Thr His Gin Ser Asp Vai Trp Ser ?he 

260 26 5 2 7 C 

Gly Vai Leu Leu Trp Glu He Phe Thr Leu Gly Gly Ser Pro Tyr Pre 

21 - 230 2S5 

G.y Val Pro Val Glu Glu Leu Phe Lys Leu Leu Lvs Glu Glv His A-cr 
290 295 300 

Met Asp Lys Pro Ser Asn Cys Thr Asn Glu Leu Tvr Met Met M-t Ara 

3C5 310 315 ' 320 

Asp Cys Trp His Ala Val Pro Ser Gin Arg Pro Thr Phe Lys Gin Leu 



325 

Val Glu Asp Leu Asp Arg He Val Ala Leu Thr 



335 



Ser Asn Gin Glu 
340 345 35 0 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 3 base pairs 

< B > TYPE : nucleic acia 

(C) S TRANDEDNE S S : double 

( D ) TOPOLOGY : 1 inear 

MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

AT G C TAG C AG GGGTCTCTGA GTATGAGCTT CCCGAAGACC CTCGCTGGGA GCTGCCTCGG 6 0 

GACAGACTGG TCTTAGGCAA ACCCCTGGGA GAGGGCTGCT TTGGGCAGGT GGTGTTGGCA 12 0 

GAGGCTATCG GGCTGGACAA GGACAAACCC AACCGTGTGA CCAAAGTGGC TGTGAAGATG 18 0 

TTGAAGTCGG ACGCAACAGA GAAAGACTTG TCAGACCTGA TCTCAGAAAT GGAGATGATG 24 0 

AAGATGATCG GGAAGCATAA GAATATCATC AACCTGCTGG GGGCCTGCAC GCAGGATGGT 3 00 

CCCTTGTATG TCATCGTGGA GTATGCCTCC AAGGGCAACC TGCGGGAGTA CCTGCAGGCC 36 0 

CGGAGGCCCC CAGGGCTGGA ATACTGCTAC AACCCCAGCC ACAACCCAGA GGAGCAGCTC 42 0 
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TCCTCCAAGG ACCTGGTGTC CTGC33CTA Z CAGGTGGCCC GAGGCATGGA 3TATCTGGCC 430 

TCCAAGAA3C G GAT AC A C 3 3 AGAGCTGGCA 3G3AGGAATG TCCTGGTGAC A G A 3 G A 3 AA T 540 

GTGAT 3AAGA TAG CAGACTT ^GGCZTZGCA GG3GACATTC ACCACATCGA 3 T A C T A T AAA 60 Z 

AAG A C AAC G A ACGGCCGACT GCCTGTGAAG TG3ATGGGAC GGGAGGCATT ATTTGACCGG 66 0 

ATCTA3AG33 A CC AGAGTG A TGT3TGGT3T TTC333GTG3 T3GT3TGGGA GATCTT 3ACT 72 0 

C^GGGZGGZT 3 CCC AT AG 33 CGGTGTGCCT GT3GAGGAAC TTTTGAAGCT GCTGAAGGAG 7B0 

GGTCACCGCA TGGACAAGCC CAGTAACTGG ACCAACGAGC TGTACATGAT GATGCGGGAC 84 0 

TGCTGGCAT3 CAGTGCCCTC AC AG AG ACT Z ACCTTCAAGC AGCTGGTGGA AGACCTGGAC 90 0 



933 



CGCATCGTGG CCTTGACCTC 3AACCAGGA3 TAG 

(2) I NFGRMAT I ON FOR SEQ ID NO : 5 : 

(i; SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(::' MOLECULE TYPE : cDNA 

I Xl.) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATGCGGGGTT CTCATCATCA TCATCATCAT GGTATGGCTA GCATGACTGG TGGACAGCAA 6 0 

ATGGGTCGGG ATCTGTACGA CGATGACGAT AAGGATCCGA GCTCGAGATC TGCAGCTGGT 12 0 

ACCATGGTAG CAGGGGTCTC TGAGTATGAG CTTCCCGAAG ACCCTCGCTG GGAGCTGCCT 18 0 

CGGGACAGAC TGGTCTTAGG CAAACCCCTG GGAGAGGGCG CCTTTGGGCA GGTGGTGTTG 24 0 

GCAGAGGCTA TCGGGCTGGA CAAGGACAAA CCCAACCGTG TGACCAAAGT GGCTGTGAAG 3 00 

ATGTTGAAGi CGGACGCAAC AGAGAAAGAC TTGTCAGACC TGATCTCAGA AATGGAGATG 3 6 r. 

AT G AAG AT G A TCGGGAAGCA TAAGAATATC ATCAACCTGC TGGGGGCCTG CACGCAGGAT 4 2 0 

GGTCCCTTGT ATGTCATCGT GGAGTATGCC TCCAAGGGCA ACCTGCGGGA GTACCTGCAG 4 3 C 

GCCCGGAGGC CCCCAGGGCT GGAATACTCC TACAACCCCA GCCACAACCC AGAGGAGCAG 54 0 

CTCTCCTC2A AGGACCTGGT GTCCTGCGCC TACCAGGTGG C C C GAG G C AT 3GAGTATCTG 6 00 

GCCTCCAA 3A AGTGCATACA CCGAGACCTG GChGCC AGGA ATGTCCTGGT GACAGAGGAC 66 0 
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-^vir.^n.^. -GkACGGCCG ACTGGCTGTG AAGTGGATG3 CAC3CGAGGG ATTATT7GAC ^ 8 C 

:oA:C7ACA 3CCACCAGAG T3ATGT3TGG 73TTTCGG33 TGCTCCTGTG GGAGATC7TC 340 



ACTCTGGGCG GCTCCCCATA CCCCGGTGTG GCTGTGGAGG AACTTTTCAA GCTGCTGAAG 

3AGGGTCACC 3 C AT 3G AC AA 33CCAGTAAC T3CACCAAC3 AGCTGTACAT GATGATGCGG 

GACTGCTGGC A7GCAGTGCC CTCACAGAGA CCCACCTTCA AGCAGCTGGT GGAAGACCTG 

GACCGCAT3G TGGCCTTGAC CTCCAACCAG GAG TAG 



90 0 
36 0 
102 0 
1056 
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CP MET 1719 

CG MET 17 1? 

SD MET 1719 

CE MET 1719 

C MET 1719 

0 MET 1719 

N ASP 17 2 0 

CA AS? 17 2 0 

C5 AS? 1720 



ATOM 2 2 61 

, » , -> -> Q O 

ATOM 22 8 3 
ATOM 2234 
ATOM 2 2 8 5 

, -p^.,, TIPS 

ATOM 2 2 8 7 

ATOM 2 2 8 8 

ATOM 2 2 9 0 

ATOM 2 2 91 

ATOM 2 2 92 CG ASP 17 2 0 

ATOM 2293 OD1 ASP 1720 

ATOM 2 2 94 OD2 AS? 17 2 0 

ATOM 2295 C ASP 1720 

ATOM 2296 O ASP 172 0 

ATOM 2297 N LYS 1721 

ATOM 2299 CA LYS 1721 

ATOM 23C0 CB LYS 1721 

ATOM 2301 CG LYS 1721 

ATOM 2302 CD LYS 1721 

ATOM 2303 CE LYS 1721 

ATOM 2 3 04 NZ LYS 1/21 

ATOM 2308 C T -YS 1721 

ATOM 2309 0 LYS 1721 

ATOM 2 310 N PRO 1722 

ATOM 2 311 CD PRO 17 22 

ATOM 2 312 CA PRO 17 22 

ATOM 2313 CB PRO 1722 

ATOM 2314 CG PRO 1722 

ATOM 2315 C PRO 1722 

ATOM 2 316 O PRO 17 2 2 

ATOM 2317 N SER 1723 

ATOM 2319 CA SER 1723 

- - ^ on 17 2 3 

A'l'UM ^j>^ ^ - 

ATOM 1321 OG SER - 1 - - - - 

ATOM 2323 C SER 1723 

ATOM 2324 O SER 1723 

ATOM 2325 N ASN 1724 

ATOM 2327 CA ASN 1724 

ATOM 2328 CB ASN 1724 

ATOM 2329 CG ASN 1724 

ATOM 2330 OD1 ASN 1724 

ATOM 2 331 ND2 ASN 172 4 

ATOM 2334 C ASN 1724 

ATOM 2335 O ASN 1724 

ATOM 2336 N CYS 1725 

ATOM 2338 CA CYS 1725 

ATOM 2339 CE CYS 1725 

ATOM 2340 SG CYS 1725 

ATOM 2 3 41 C CYS 17 2 5 

ATOM 2342 O CYS 1725 

ATOM 2343 N THR 1726 



34 428 -0.459 9.448 I - 0 0 32.33 
33.148 -0.235 10.277 1.00 34.72 
32 454 1.066 10.076 1.00 35.04 
3l'.025 1-447 11.141 1.00 34.06 
25.757 0.470 10 409 1.00 33.14 
35*063 -1.797 9.747 1.00 35.53 
34 896 -2.756 8.991 1-00 35.43 
35.826 -1.843 10.840 1.00 38.65 
36.521 -3.049 11.281 1.00 39.03 

37 659 -2.678 12.237 1 . 00 43.11 

38 743 -1.846 11-569 1.00 46.69 
, 8 587 -1.536 10.364 1.00 54.08 
<9 750 -1.503 12.239 1.00 45.93 
35 580 -4.023 11.972 1.00 38,50 

34 554 -3.617 12.528 1.00 37.73 

35 961 -5.298 11.981 1.00 38.10 
35 ^51 -6.339 12.600 1.00 38.12 
35*^27 -7.733 12.323 1.00 38.20 

34 82^ -8.858 12.825 1 00 38.48 

35 3 75 -10.238 12.543 1.00 37.49 

36 320 -10.691 13.625 1.00 39 11 
a id - — i£7 13 .628 1.00 40.75 

35 092 -6.142 14.0?± x.uu - - 4 
36"l36 -6.032 14 739 1.^0 42.70 
33 .e75 -6 . 082 14.658 1. 00 41.23 
32.347 -6.153 14.019 1.00 38.53 

33 -43 -5.901 16.104 1-00 *1.7i 
32 223 -5.957 16.306 1.00 38.90 
31*679 -5.442 15.016 1.00 34.19 

34 4-18 - 7.079 16.819 1.00 43.96 
34.542 -8.174 16.250 1.00 43.02 

34 915 -6.860 18.028 1.00 46.7b 
35*493 -7.973 18.747 1.00 50.74 
36265 -7.500 19.980 1.00 49.47 

35 400 -7.130 21.035 1.00 53.67 
34 259 -8.782 19.143 1-00 53.24 

33 136 -8.259 19.130 1.00 53.97 

34 443 -10.064 19.426 1.00 56.59 
33 316 -10.899 19 825 1.00 59.55 
32 739 -10.386 21.162 1.00 66.12 
33*824 -10-128 22.213 1.00 71.34 
34.661 -10.990 22.485 1.00 73.38 
33.831 -8.926 22.773 

_ „ „ ^ -in -J 1 "I 



0 0 7 4.19 

nn R7 . 31 



3l'.073 -10.662 18.940 1.00 59.27 

7 f' "32 17. 493 - or> 54.50 

31 8^ -11.203 16.300 1.00 50.89 

31 g27 -9.848 15.576 1.00 50.09 

30 893 -9.833 14.006 1.00 44.81 

32 596 -12.235 15.439 1.00 47.28 
33820 -12.172 15.288 1.00 48.97 

31 863 -13.229 14.950 1.00 42.60 
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Whan is claimed is : 

!• A crystalline form cf a polypeptide 
corresponding to the catalytic domain of a orotem 
tyrosine kinase. 

10 2 - The crystalline forn of claim 1, wherein said 

protein tyrosine kinase is a receptor protein tyrosine 
kinase . 

3. The crystalline form of claim 2, wherein said 
15 receptor protein tyrosine kinase is selected from the 

group consisting of PDGF - R , FLK, CCK4 , MET , TRKA, AXL, 
TIE, EPH, RYK , DDR , ROS , RET , DTK, ROR1 , and MUSK . 

4. The crystalline form of claim 1, wherein said 
20 protein tyrosine kinase is a non-receptor protein 

tyrosine kinase. 

5. The crystalline form of claim 4, wherein said 
non-receptor protein tyrosine kinase is selecr^H from a 

25 group consisting of SRC, BRK , BTK , CSK, ABL, ZAP7G, FES , 

FAK, JAK, and ACK . 

6. The crystalline form of claim 1, comprising 
one or mere heavy metal atoms . 

30 

7. The crystalline form cf claim 1, where::, said 
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crvsta.. 



20 



25 



.e :orr; or c.aiT. 7 , wnerem said 
FGFR is FGFR1 . 



. ^. -ne crystalline forrr. of claim 8, defined by 
atomic structural coordinates set fort" m Table 1 

10. The crystalline form of claim 7, comprising at 
least one compound. 

1-. The crystalline form of claim 10, wherein said 
compound is a nucleotide analog. 

12. The crystalline form of claim 11, wherein said 
nucleotide analog is AMP-PCP. 

13. The crystalline form of claim 12, defined by 
atomic structural coordinates set forth m Table 2 . 

14. The crystalline form of claim 10, wherein said 
compound is an indclinone compound. 

15. The crystalline form of claim 14, wherein said 
indolinone compound has a structure set forth m formula 
I or II: 
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or a pharmaceutical]^ acceptable salt:, isomer, 
metabolite, ester, amide, or prodrug thereof, wherein 
5 (a) A, , A 2 , A 3 , and A 4 are i independently carbon or 

nitrogen; 

(b) R : is hydrogen or alkyl; 

fc) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a tniolmdolmone ; 
10 !d) R-j is hydrogen; 

(e) R. , R 5 , R 6 , and R~ are optionally present and are 
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15 



20 



25 



3C 



eiiner i. ir.de^er.der.t" -.■ = ^",= --=-3 -v-~ a >-- 

:=r.sis::r.g of r.ydroger.. alkyl. alkoxy, aryl, arvioxv 
alkaryl, alkaryloxy, halogen, trihalorr.ethv 1 . SiCR. 

~~ 'o ■ R, 



!0 : NRR ' , SO,-. , SR , N0 2 , NRR ' , OH , CN , C ( 0 ) R , QC CQ.p 



NHC-OJR. 'CK : :,C0 2 R, and CCNRR • or (ii) any two adjacent 
R«, R 5 , Rc. / and R- taken together forrr. a fused ring with 
the aryl portion of the oxmdole -based portion of the 
mdol ir.one ; 

if) R : ' ' R '' < ? V ' R= ' , and R c are each 
independently selected from tne group consisting of 
hydrogen, alkyi, alkoxy, aryl, aryloxy, alkaryl, 
alkaryloxy, halogen, tnhalorr.ethyl , S(0)R, SQ 2 KRRy SOj R, 
SR, N0 2/ NRR', OK, CN, C(0)R, O0(O)R, NHC(0)R, (CH,)..CC,R 
and CONRR ' ■ 

(g) n is C, 1, 2, or 3 • 

(h) R is hydrogen, alkyl or aryl ■ 

(i) R' is hydrogen, alkyl or aryl; and 

(j) A is a five metered heteroaryl ring selected 
frorr. the group consisting of thiophene, pyrrole, 
pyrazole, imidazole, 1 , 2 , 3 - triazole , 1 , 2 , 4- triazoie , 
oxazole, isoxazole, thiazole, isothiazole, furan, 1,2,3- 
oxadiazole, 1 , 2 , 4 -oxadiazole , 1 , 2 , 5 -oxadiazoie , 1,3,4- 
oxadiazole, 1 , 2 , 3 , 4 -oxatriazole , 1 , 2 , 3 , 5 -oxatriazole , 

1.2.3- thiadiazoie , 1 , 2 , 4 - thiadiazole , 1 , 2 , 5 - thiadiazoie , 

1. 3 . 4 - thiadiazoie , 1 , 2 , 3 , 4 - thiatriazole , 1,2,3,5- 
tniatriazoie, and tetrazole, optionally substituted at 
one or more positions with alkyi, alkoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl , S(0)R, 
SC,NRR-, SCR, SR, NO 2 , NRR', OH, CN , C(0)R, OC(0)R, 
NHC(0)R, ;CH 2 ;,CC 2 R or CONRR'. 
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16. Tne c r y stallir.e f c rr. or c i a i m . 
indolinone compound is 3 - [ 3 - 2-carboxye: 
met hylpyrroi - 5 - yl ; methylene; - 2 - maounc^ 



5 17. The crystalline for- of claim 15, wherein said 

indolinone compound is 3 - [4 - { 4 - f ormyipiperaz me - 1 - 
yl ) benzyl i deny 1] - 2 - indolinone . 

18. The crystalline form of claim 16, defined by 
10 the atomic structural coordinates of Table 3 . 

19. The crystalline form of claim 17, defined by 
the atomic structural coordinates of Table 4. 



20. The crystalline form of claim 1, having 
monoclinic unit cells. 



21. The crystalline form of claim 20, wherein said 
monoclinic unit ceils have dimensions of about a=208.3 

20 A, b = 57.8 A, c=65.5 A and (5 = 107. 2 5 . 

22. The crystalline form of claim 20, wherein said 
monoclinic unit cells have dimensions of about a=211.6 
A, b=51.3 A, c=6fa.l A and p=107.7°. 

25 

23. The crystalline form of claim 10, comprising 
one or more heavy metal atoms. 

24. A polypeptide corresponding to the catalytic 
30 domain of a protein tyrosine kinase, containing at least 

about 20 amino acid residues upstream of the first 
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20 



25 



30 



glycine m conserved glycine - rich region of tr.e 

catalytic domain, and a: lease about 17 anno acid 
residues downstream of :ne conserved argmme located a: 



cne ^-termma^ oounaary of the cat; 



:omam . 



25. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a receptor protein tyrosine 



26. The polypeptide of claim 24, wherein said 
protein tyrosine kinase is a non-receptor crotem 
tyrosine kinase. 

27. The polypeptide cf claim 25, wherein said 
receptor tyrosine kinase is selected from the group 
consisting cf FGF-R, PDGF-R, KDR , CCK4 , MET , TRKA , AXL, 
TIE, EPH, RYK, DDR, ROS , RET , LTK , R 0 R1 , and MUSK . 

28. The polypeptide of claim 26, wherein said non- 
receptor kinase is selected from the group consisting of 
SRC, BRK, BTK, CSK, ABL, ZAP 7 0 , FES, FAK , JAK , and ACK . 

29. The polypeptide cf claim 24 having the ammo 
acid sequence shown in SEQ ID NO : 4 . 

30. A method cf using the polyoeot ide of claim 24 
to form a crystal, comprising the steps of: 

(a; mixing a volume of polypeptide solution 
with a reservoir solution; and 

(b) incubating the mixture obtained m step 
(a) over the reservoir solution in a closed container, 
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under co^::;cns suitable for crystal! :zai:c- . 

31. A method of obtaining an FGF receptor tvrcsine 
kinase domain polypeptide m crystalline form, 
comprising the steps of : 

( a ) mixing a volume of polypept ide solution 
with an equal volume of reservoir solution, wherein said 
polypeptide solution comprises 1 mg/m.L to 60 oig/rrl FGF- 
type tyrosine kinase domain protein, 10 t.M to 200 mM 
buffering agent, 0 m.M to 20 mK dithiothrei toi and has a 
pH cf about 5.5 to about 7.5, and wherein said reservoir 
solution comprises 10% to 30% (w/v) polyethylene alycoi, 
0.1 M to 0.5 M ammonium sulfate, 0% to 20% (w/v) 
ethylene glycol or glycerol, 10 mM to 200 mM buffering 
agent and has a pH of about 5.5 to about 7.5; and 

(b) incubating the mixture obtained in step 
(a) over said reservoir solution in a closed container 
at a temperature between 0° and 25° °C until crystals 
form . 

32. The method of claim 31, wherein said 
polypeptide solution comprises about 10 mg/mL FGF 
receptor tyrosine kinase domain, about 10 mM sodium 
chloride, about 2 mM dithiot hre i t ol , about 10 mv Tris- 
KC1 and has a pH of about 8; the reservoir buffer 
comprises about 16% (w/v) polyethylene glycol (MW 
10000/, about 0.3 M ammonium sulfate, about 5% ethylene 
glycol or glycerol, about 100 mM bis-Tris and has a pH 
of about 6.5; and the temperature is about 4°C. 

33. The method of claim 31, wherein said 
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:oT.prises a corr.oc 



25 



3C 



J4 ' A cDNA encod -? FG? receptor tyrosine 



Kinase 

c D X A has t h e n u 



omam protein, wherein a coding strand of tn^ 
:-ieotide sequence of SEQ ID NO ■ 5 



35. A method of determining three dimensional 

structures of protein tyrosine kinases with unknown 

structure comprising the step of applying structural 

10 atomic coordinates set forth in Table 1, Table 2, Table 

3 , or Table 4 . 

36. The method of claim 35, comprising the 
following steps : 

15 (a) aligning a first computer representation 

of an amino acid sequence of a protein tyrosine kinase 
of unknown structure with a second computer 
representation of a protein tyrosine kinase of known 
structure by matching homologous regions of amino acid 
20 sequences of said first computer representation and said 

second computer representation; 

(b) transferring computer representations of 
amino acid structures in said protein tyrosine kinase of 
known structure to computer representations of 
corresponding amino acid structures in said protein 
tyrosine kinase with unknown structure; and 

(c; determining a low energy conformation of 
the protein tyrosine kinase structure resultinc 
steo (b) . 



i f r O TT. 



37. The method of claim 35, comprising the 



BNSOOCID <WO 9e07836A2JA> 



WO 98/07835 



PCTAUS97/14885 



following seeps : 

ia; aligning the positions of atoms m the 
unit cell by matching electron diffraction data from two 
crystals ; and 

5 (b ] determining a low energy conformation of 

the resulting protein tyrosine kinase structure. 

38. The method of claim 35, comprising the 
following steps : 

10 (a) determining the secondary structure of a 

protein tyrosine kinase structure using NMR data; and 
(b) simplifying the assignment of through- 
space interactions of amino acids . 

15 39. The method of any one of claims 35, 36, 37, or 

38, wherein said protein tyrosine kinase with or without 
known structure is a receptor protein tyrosine kinase. 

40. The method of claim. 35, wherein said recent or 
20 protein tyrosine kinase with or without known structure 

is selected from the group consisting of FGF-R, PDGF-R, 
FLK, CCK4 , MET , TRKA, AXL , TIE , EPH, RYK, DDR, ROS , RET, 
LTK, ROR1, and MUSK. 



25 41. The method of anyone of claims 35, 36, 37, or 

38, wherein said protein tyrosine kinase with or without 
known structure is a non-receptor protein tyrosine 
kinase . 



30 42. The method of claim 41, wherein said protein 

tyrosine kinase with or without known structure is 



BNSDOCID: <WO ©e07836A2JA> 



WO 98/07835 



PCm T S97/14885 



15 



20 



25 



.3 J 



D '" ::ec: :rcT ' ~ r - e consisting of SRZ, B?.K, 3TK , 

CSK. A3L , ZA?7 0 , FES, FAK , JAK , and ACK . 

43. A method of identifying a potential modulate 
of protein tyrosine kinase function by docking a 
computer representation of a structure of a compound 
with a computer representation of a structure of a 



vity formed by the active -site 



s f a protein tyrosine 



kinase, wherein said structure of said protein tyrosine 
10 kinase is defined by atomic structural coordinates set 

forth in Table l, Table 2, Table 3, or Table 4. 

44. The method of claim 43, comprising the 
following steps : 

(a) removing a computer representation of a 
compound complexed with a protein tyrosine kinase and 
docking a computer representation of a compound from a 
computer data base with a computer representation of the 
active-site of the protein tyrosine kinase; 

(b) determining a conformation of the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 

(c) identifying compounds that best fit said 
active-site as potential modulators of protein tyrosine 
kinase function. 

45. The method of claim 43, comprising the 
following steps: 

(a) modifying a computer representation of 
compound compiexed with a protein tyrosine kinase by the 
deletion of a chemical group or groups or by the 
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addition of a chemical group or groups; 

(bi decerrr.irung a conformation c: the complex 
resulting from step (a) with a favorable geometric fit 
and favorable complementary interactions; and 
5 (c) identifying compounds that best fit the 

protein tyrosine kinase active-site as potential 
modulators of protein tyrosine kinase function. 

46. The method of claim 43, wherein said method 
10 comprises the following steps: 

(a) removing a computer representation of a 
compound complexed with a protein tyrosine kinase; and 

(b) searching a data base for data base 
compounds similar to said compounds using a compound 

15 searching computer program or replacing portions of said 

compound with similar chemical structures from a data 
base using a compound construction computer program. 

47. The method of any one of claims 43, 44, 45, or 
20 46, wherein said protein tyrosine kinase is a receptor 

protein tyrosine kinase . 

48. The method of claim 47, wherein said receptor 
protein tyrosine kinase is selected from the group 

25 consisting of FGF-R, PDGF-R, FLK, CCK4 , MET , TRKA, AXL, 

TIE, EPH, RYK, DDR, ROS , RET, LTK, ROR1 , and MUSK . 

49. The method of anyone of claims 43, 44, 45, or 
46, wherein said protein tyrosine kinase is a non- 
30 receptor protein tyrosine kinase. 
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^i.- _ _^i... 4^, serein saia pro:e:n 

^--^^L_e^ ^r^iTi L.ie group consist ma c 
SRC, 3RK , BTK. CSK, ABL , ZAF7 0 , FES. FAY. , JAK , and ACK . 



51. a potential modulator of protein tyrosine 
kinase function identified by the method of any one of 
claims 43, 44, 45, or 46. 



10 



15 



52. The potential modulator of claim 51, wherein 
said modulator is selected from a computer data base. 

53. The potential modulator of claim 51, wherein 
said modulator is constructed from chemical groups 
selected from a computer data base 



54 . The potential modulator of protein tyrosine 
kinase function of claim 51, wherein said modulator is 
an indoiinone compound of formula I or II: 
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or a pharmaceut ical ly acceptable salt, isomer, 
metabolite, ester, amide, or prodrug thereof, wherein 
(a J A lt A 2 , A 3 , and A. are independently carbon or 
5 nitrogen; 

(b) R, is hydrogen or alkyl; 

(c) R 2 is oxygen in the case of an oxindolinone or 
sulfur in the case of a thiol indoli none ; 

(d) R 5 is hydrogen; 

10 (e) R 4 , R 5 , R 6/ and R 7 are optionally present and are 

either (i) independently selected from the group 
consisting of hydrogen, alkyl, alkoxy, aryl , aryloxy, 
alkaryl, alkaryloxy, halogen, t rihalomethyl , S(0)R, 
S0 2 NRR', S0 3 R, SR, N0 2 , NRR 1 , OH, CN, C(0)R, OC(0)R, 

15 NHC(0)R, (CH 2 ) R C0 2 R, and CONRR 1 or (ii) any two adjacent 

^•i/ ^5 , ^ 0 / arid r 7 taken together form a fused ring with 
the aryl portion of the oxmdole -based portion of the 
indolinone 

(f) R 2 ', R 3 * , R 4 ' , R 5 ' , and R 6 ■ are each 

20 independently selected from the group consisting of 

hydrogen, alkyl, alkoxy, aryl, aryloxy, alkaryl, 
alkaryioxy, halogen, t rihalomethyl , S(0;R, S0 2 NRR ' _ 7 S0 3 R, 
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£?.. KC Z , NRR ' , CH, CI7. Z Z R C-~'Q - ~ - 

and 20NRR 1 ,- 

■ g r. is C- , 1 , 2 , cr 3 ; 

'h) R is hydrogen, alkyl cr aryl ; 

ii) R ' is hydrogen, alkyl or aryl; and 

:j) A is a five membered heceroaryi ring selected 
from the group consisting of thiophene, pyrrole, 
pyrazole, imidazole, 1, 2, 3-triazole, 1,2,4-triazole, 
oxazoie, isoxazole, thiazoie, isothiazole, furan, 1,2,3- 
cxadiazole, 1 , 2 , 4 - oxadi azole , 1 , 2 , 5 - oxadiazcle , 1,3,4- 
cxadiazole, 1 , 2 , 3 , 4 -oxatriazole , 1 , 2 , 3 , 5 - oxatr iazoie , 

1.2.3- thiadiazoie, 1 , 2 , 4 - thiadiazole , 1 , 2 , 5 - thiadiazole , 

1. 3 . 4 - thiadiazole, 1 , 2 , 3 , 4 - thiatriazoie , 1,2,3,5- 
thiatriazole, and tetrazole, optionally substituted at 
one or more positions with alkyl, aikoxy, aryl, aryloxy, 
alkaryl, alkaryloxy, halogen, trihalomethyl , S(0)R, 
S0 2 NRR', SO s R, SR, N0 2 , NRR ' , OH, CN, C(0)R, 0C(0)R, 

NKC ( O ) R , { CH 2 ) n CO,R or CONRR ' . 



20 55. A method of identifying a potential modulator 

of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, comprising the 
following steps: 

(a) administering said potential modulator to 

25 cells,- 

(b) comparing the level of protein tyrosine 
kinase phosphorylation between cells not administered 
the potential modulator and ceils administered said 
potential modulator; and 

30 (c) identifying said potential modulator as a 

modulator of protein tyrosine kinase function based on 
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tne difference in the level of protein :yr:s;ne kinase 
phosphory lat ion . 

56 . A method of identifying a potential modulator 
of protein tyrosine kinase function as a modulator of 
protein tyrosine kinase function, wherein said method 
comprises the following steps: 

(a) administering a preparation of said 
potential modulator to cells; 

(b) comparing the rate of cell growth between 
cells not administered the modulator and ceils 
administered the modulator; and 

(c) identifying said potential modulator as a 
modulator of protein tyrosine kinase function based on 
the difference in the rate of cell growth. 

57. A method of treating a disease associated with 
a protein tyrosine kinase with inappropriate activity in 
a cellular organism, wherein sand method comprises the 
steps of : 

(a) administering a modulator of protein 
tyrosine kinase function to the organism, wherein said 
modulator is in an acceptable pharmaceutical 
preparat ion ; and 

(b) activating or inhibiting the protein 
tyrosine kinase function to treat the disease. 

58. The method of any one of claims 55, d6, or 57, 
wherein said protein tyrosine kinase is a receptor 
protein tyrosine kinase. 
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4 64 

~- -j.airr. ;s, A'r.ereir. saia recector 

protein tyrosine kinase is selectee frc~ the grcuc 
containing FGF-R, PDGF-R , ELK , CCK-1 , MET , TRKA , AXL . 
TIE, E?K , RYK, DDR, ROS , RET , LTK , R0R1 , and MUSK. 

SO. The method of any one of claims 55, 56, cr 57, 
wherein said protein tyrosine kinase is a non-receptor 
protein tyrosine kinase. 

61. The method of claim 60, wherein said non- 
receptor protein tyrosine kinase is selected from a 
group consisting of SRC, BRK , BTK, CSK, ABL, ZAP70, FES, 
FAK, JAK, and ACK . 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 



1. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: 
POGF-R. The polypeptide and the method used to form the 
crystal . 



2. Claims: 1-3, 6, 24-25, 27, 30 all partially . 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine 
kinase: FLK/KDR. The polypeptide and the method used to form 
the crystal . 



3. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 

catalytic domain of a receptor protein tyrosine kinase: 

CCK4. The polypeptide and the method used to form the 
crystal . 



4. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: 
MET. The polypeptide and the method used to form the crystal. 

5. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 

catalytic domain of a receptor protein tyrosine kinase: 

TRKA. The polypeptide and the method used to form the 
crystal . 



6. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: AXL. 
The polypeptide and the method used to form the crystal. 

7. Claims: 1-3, 6, 24-25, 27, 30 all partially 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: TIE. 
The polypeptide and the method used to form the crystal. 
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Claims: 1-3, 6, 24-2^ 








Crystal 1 ine form of 
catalytic domain of 
The polypeptide and 


a polypeptide correspond! nq to the 
a receptor protein tyrosine kinase: 
the method used to form the crystal 


EPH. 


9 . 


Claims: 1-3, 6, 24-25, , 


27, 30 al 1 partial ly 






Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: 
The polypeptide and the method used to form the crystal 


RYK. 


10. 


Claims: 1-3, 5, 24-25, 


27, 30 all partially 






Crystall ine form of 
catalytic domain of 
The polypeptide and 


a polypeptide corresponding to the 
a receptor protein tyrosine kinase: 
the method used to form the crystal 


DDR. 


11. 


Claims: 1-3, 6, 24-25, 


27, 30 all partially 






Crystalline form of 
catalytic domain of 
The polypeptide and 


a polypeptide corresponding to the 
a receptor protein tyrosine kinase: 
the method used to form the crystal 


ROS. 


12. 


Claims: 1-3, 6, 24-25, 


27, 30 all partially 






Crystalline form of a polypeptide corresponding to the 
catalytic domain of a protein tyrosine kinase receptor: 
The polypeptide and the method used to form the crystal 


RET. 


13. 


Claims: 1-3, 6, 24-25, 


27, 30 all partially 






Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: 
The polypeptide and the method used to form the crystal 


LTK. 


14. 


Claims; 1-3, 6, 24-25, 


27, 30 all partially 






Crystalline form of a polypeptide corresponding to the 
catalytic domain of a receptor protein tyrosine kinase: 
R0R1. The polypeptide and the method used to form the 
crystal . 




15. 


Claims: 1-3, 6, 24-25, 


27, 30 all partially 






Crystall ine form of 
catalytic domain of 
MUSK. The polypeptide 


a polypeptide corresponding to the 
a receptor protein tyrosine kinase: 
and the method used to form the crystal. 
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16. Claims: 1,4-6, 24, 25, 23, 30 al'l partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
SRC. The polypeptide and the method usea to form the crystal. 

17. Claims: 1,4-6, 24, 25, 28, 3G all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
BRK. The polypeptide and the method used to form the crystal. 

18. Claims: 1,4-6, 24, 26, 28, 30 all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
BTK. The polypeptide and the method used to form the crystal. 

19. Claims: 1,4-6, 24, 26, 28, 30 all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
CSK. The polypeptide and the method used to form the crystal, 

20. Claims: 1,4-6, 24, 26, 28, 30 all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
ABL. The polypeptide and the method used to form the crystal. 

21. Claims: 1,4-6, 24, 26, 28, 30 all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
ZAP70. The polypeptide and the method used to form the 
crystal . 



22. Claims: 1,4-6, 24, 26, 28, 30 all partially. 

Crystalline form of a polypeptide corresponding to the 
catalytic domain of a non-receptor protein tyrosine kinase: 
FES. The polypeptide and the method used to form the crystal. 
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23. 



C " ,B: 1 ' 4 " 6 - 24 ' 26 ' *». 30,1, p.rti.11,. 



"ffl^fj o°j a 3 n P o°'T Pt,de "^^ding to the 



24. 



cK" "c^™ % f SjWMd. corresponding to the 



25. 



Claims: 1,4-6, 24 26 ?a in »n . , 

• <£o, ^8, 30 all partially. 



c C a r ta S fnlc n do4°™ f „ P ™ y £ pt, * ! «"™Po-KHn, to the 

26. Claims: l. 24-25 77 ™ n 

«'34 2 !i, 2 e 7 ^j y f"»»» 7-23. 29, 

which is complexed or not r?2! ptor protein tyrosine kinase 
(CONA encod^g?t) and ?he ^VS^S™ 6 ^Wilt 
The cONA encoding FGFR t Jros^i nlse ll inl^Tl]' 

27. Claims: 35-42 completely 

28. Claims: 43-5G 

SJSlnTkJSS"^, 1 POtentii " ™" iu, "P'- Protei" 
concuter ' '^:? n W u *<"° a three dimensional 

29- Claims, 51-54. 57 .,, c0 ^, etely Jnd ^ a]] 
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30. Claims: 55 completely and 58-61 all partially 

Method of identifying a potential modulator of protein 
tyrosine kinase by comparing the level of protein kinase 
phosphorylation hetween cells not administered the potential 
modulator and cells administered said potential modulator. 

31. Claims: 56 completely and 58-61 all partially 

Method of identifying a potential modulator of protein 
tyrosine kinase by comparing the rate of cell growth between 
cells not administered the modulator and cells administered 

the modulator. 
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